osanseviero HF staff commited on
Commit
8a8eb44
1 Parent(s): e7cd942

Update spaCy pipeline

Browse files
LICENSES_SOURCES CHANGED
@@ -1,4 +1,4 @@
1
- # UD French Sequoia v2.5
2
 
3
  * Author: Candito, Marie; Seddah, Djamé; Perrier, Guy; Guillaume, Bruno
4
  * URL: https://github.com/UniversalDependencies/UD_French-Sequoia
 
1
+ # UD French Sequoia v2.8
2
 
3
  * Author: Candito, Marie; Seddah, Djamé; Perrier, Guy; Guillaume, Bruno
4
  * URL: https://github.com/UniversalDependencies/UD_French-Sequoia
README.md CHANGED
@@ -4,7 +4,7 @@ tags:
4
  - token-classification
5
  language:
6
  - fr
7
- license: lgpllr
8
  model-index:
9
  - name: fr_core_news_sm
10
  results:
@@ -14,47 +14,47 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8141885091
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8094952164
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.8118350797
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS Accuracy
29
  type: accuracy
30
- value: 0.934480272
31
  - task:
32
  name: SENTER
33
  type: token-classification
34
  metrics:
35
  - name: SENTER Precision
36
  type: precision
37
- value: 0.8642857143
38
  - name: SENTER Recall
39
  type: recall
40
- value: 0.8902232487
41
  - name: SENTER F Score
42
  type: f_score
43
- value: 0.8725961538
44
  - task:
45
  name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
  - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
- value: 0.8787913869
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
  - name: Labeled Dependencies Accuracy
56
  type: accuracy
57
- value: 0.8787913869
58
  ---
59
  ### Details: https://spacy.io/models/fr#fr_core_news_sm
60
 
@@ -63,12 +63,12 @@ French pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, s
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `fr_core_news_sm` |
66
- | **Version** | `3.1.0` |
67
- | **spaCy** | `>=3.1.0,<3.2.0` |
68
  | **Default Pipeline** | `tok2vec`, `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `tok2vec`, `morphologizer`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
71
- | **Sources** | [UD French Sequoia v2.5](https://github.com/UniversalDependencies/UD_French-Sequoia) (Candito, Marie; Seddah, Djamé; Perrier, Guy; Guillaume, Bruno)<br />[WikiNER](https://figshare.com/articles/Learning_multilingual_named_entity_recognition_from_Wikipedia/5462500) (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran)<br />[spaCy lookups data](https://github.com/explosion/spacy-lookups-data) (Explosion) |
72
  | **License** | `LGPL-LR` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
@@ -76,11 +76,11 @@ French pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, s
76
 
77
  <details>
78
 
79
- <summary>View label scheme (240 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
- | **`morphologizer`** | `POS=PROPN`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Dem`, `Gender=Fem\|Number=Sing\|POS=NOUN`, `Number=Plur\|POS=PRON\|Person=1`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=SCONJ`, `POS=ADP`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Ord\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=NOUN`, `POS=PUNCT`, `Gender=Masc\|Number=Sing\|POS=PROPN`, `Number=Plur\|POS=ADJ`, `Gender=Masc\|Number=Plur\|POS=NOUN`, `Definite=Ind\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Number=Sing\|POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `POS=ADV`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=PROPN`, `Definite=Def\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Card\|POS=NUM`, `Definite=Def\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=ADJ`, `POS=CCONJ`, `Gender=Fem\|Number=Plur\|POS=NOUN`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=ADJ`, `POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `POS=PRON\|PronType=Rel`, `Number=Sing\|POS=DET\|Poss=Yes`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Number=Plur\|POS=ADP\|PronType=Art`, `Definite=Ind\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=VERB\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3`, `Number=Plur\|POS=DET`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Dem`, `POS=ADV\|PronType=Int`, `POS=VERB\|Tense=Pres\|VerbForm=Part`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Definite=Ind\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Number=Plur\|POS=DET\|Poss=Yes`, `POS=AUX\|VerbForm=Inf`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Masc\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=ADV\|Polarity=Neg`, `Definite=Ind\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3`, `POS=PRON\|Person=3\|Reflex=Yes`, `Gender=Masc\|POS=NOUN`, `POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=PRON\|Person=3`, `Number=Plur\|POS=NOUN`, `NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=AUX\|Tense=Pres\|VerbForm=Part`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|Person=3`, `Number=Sing\|POS=NOUN`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Plur\|POS=PROPN`, `Number=Sing\|POS=PROPN`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Dem`, `Gender=Masc\|Number=Sing\|POS=DET`, `Gender=Fem\|Number=Sing\|POS=DET\|Poss=Yes`, `Gender=Masc\|POS=PRON`, `POS=NOUN`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON`, `Gender=Masc\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Number=Sing\|POS=PRON`, `Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Ind\|POS=VERB\|VerbForm=Fin`, `Number=Plur\|POS=DET\|PronType=Dem`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=PRON`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Dem`, `Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=PRON`, `POS=NUM`, `Gender=Fem\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=PRON`, `Number=Plur\|POS=PRON\|Person=3`, `Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|Person=1`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=INTJ`, `Number=Plur\|POS=PRON\|Person=2`, `NumType=Card\|POS=PRON`, `Definite=Ind\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `NumType=Card\|POS=NOUN`, `POS=PRON\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3`, `Gender=Fem\|Number=Sing\|POS=DET`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Ind\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Dem`, `Gender=Masc\|Number=Plur\|POS=PROPN`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Dem`, `Number=Sing\|POS=DET`, `Gender=Masc\|NumType=Card\|Number=Plur\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Dem`, `Mood=Ind\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|POS=PRON`, `Gender=Masc\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=X`, `POS=SYM`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Int`, `POS=DET`, `Gender=Masc\|Number=Plur\|POS=PRON`, `POS=PART`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Person=3\|VerbForm=Fin`, `Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Masc\|Number=Plur\|POS=DET`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Rel`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Rel`, `POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Fem\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Imp\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|Reflex=Yes`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=1\|Reflex=Yes`, `Gender=Masc\|NumType=Card\|Number=Sing\|POS=NOUN`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Number=Sing\|POS=PRON\|Person=1\|Reflex=Yes`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|POS=PROPN`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|POS=ADV`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=PROPN`, `Gender=Masc\|NumType=Card\|POS=NUM` |
84
  | **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux:pass`, `aux:tense`, `case`, `cc`, `ccomp`, `conj`, `cop`, `dep`, `det`, `expl:comp`, `expl:pass`, `expl:subj`, `fixed`, `flat:foreign`, `flat:name`, `iobj`, `mark`, `nmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl:agent`, `obl:arg`, `obl:mod`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
  | **`senter`** | `I`, `S` |
86
  | **`ner`** | `LOC`, `MISC`, `ORG`, `PER` |
@@ -92,15 +92,21 @@ French pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, s
92
  | Type | Score |
93
  | --- | --- |
94
  | `TOKEN_ACC` | 99.90 |
95
- | `TAG_ACC` | 93.45 |
96
- | `POS_ACC` | 96.32 |
97
- | `MORPH_ACC` | 95.27 |
98
- | `LEMMA_ACC` | 90.33 |
99
- | `DEP_UAS` | 87.88 |
100
- | `DEP_LAS` | 83.82 |
101
- | `SENTS_P` | 86.43 |
102
- | `SENTS_R` | 89.02 |
103
- | `SENTS_F` | 87.26 |
104
- | `ENTS_P` | 81.42 |
105
- | `ENTS_R` | 80.95 |
106
- | `ENTS_F` | 81.18 |
 
 
 
 
 
 
 
4
  - token-classification
5
  language:
6
  - fr
7
+ license: lgpl-lr
8
  model-index:
9
  - name: fr_core_news_sm
10
  results:
 
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.8121504727
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8080541211
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.8100971185
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
  - name: POS Accuracy
29
  type: accuracy
30
+ value: 0.9312032981
31
  - task:
32
  name: SENTER
33
  type: token-classification
34
  metrics:
35
  - name: SENTER Precision
36
  type: precision
37
+ value: 0.8658823529
38
  - name: SENTER Recall
39
  type: recall
40
+ value: 0.8932038835
41
  - name: SENTER F Score
42
  type: f_score
43
+ value: 0.8793309438
44
  - task:
45
  name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
  - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
+ value: 0.8770041095
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
  - name: Labeled Dependencies Accuracy
56
  type: accuracy
57
+ value: 0.8770041095
58
  ---
59
  ### Details: https://spacy.io/models/fr#fr_core_news_sm
60
 
 
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `fr_core_news_sm` |
66
+ | **Version** | `3.2.0` |
67
+ | **spaCy** | `>=3.2.0,<3.3.0` |
68
  | **Default Pipeline** | `tok2vec`, `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `tok2vec`, `morphologizer`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
71
+ | **Sources** | [UD French Sequoia v2.8](https://github.com/UniversalDependencies/UD_French-Sequoia) (Candito, Marie; Seddah, Djamé; Perrier, Guy; Guillaume, Bruno)<br />[WikiNER](https://figshare.com/articles/Learning_multilingual_named_entity_recognition_from_Wikipedia/5462500) (Joel Nothman, Nicky Ringland, Will Radford, Tara Murphy, James R Curran)<br />[spaCy lookups data](https://github.com/explosion/spacy-lookups-data) (Explosion) |
72
  | **License** | `LGPL-LR` |
73
  | **Author** | [Explosion](https://explosion.ai) |
74
 
 
76
 
77
  <details>
78
 
79
+ <summary>View label scheme (238 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
+ | **`morphologizer`** | `POS=PROPN`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Dem`, `Gender=Fem\|Number=Sing\|POS=NOUN`, `Number=Plur\|POS=PRON\|Person=1`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=SCONJ`, `POS=ADP`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Ord\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=NOUN`, `POS=PUNCT`, `Gender=Masc\|Number=Sing\|POS=PROPN`, `Number=Plur\|POS=ADJ`, `Gender=Masc\|Number=Plur\|POS=NOUN`, `Definite=Ind\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Number=Sing\|POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `POS=ADV`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=PROPN`, `Definite=Def\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Card\|POS=NUM`, `Definite=Def\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=ADJ`, `POS=CCONJ`, `Gender=Fem\|Number=Plur\|POS=NOUN`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=ADJ`, `POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `POS=PRON\|PronType=Rel`, `Number=Sing\|POS=DET\|Poss=Yes`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Number=Plur\|POS=ADP\|PronType=Art`, `Definite=Ind\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=VERB\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3`, `Number=Plur\|POS=DET`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Dem`, `POS=ADV\|PronType=Int`, `POS=VERB\|Tense=Pres\|VerbForm=Part`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Definite=Ind\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Number=Plur\|POS=DET\|Poss=Yes`, `POS=AUX\|VerbForm=Inf`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Masc\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=ADV\|Polarity=Neg`, `Definite=Ind\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3`, `POS=PRON\|Person=3\|Reflex=Yes`, `Gender=Masc\|POS=NOUN`, `POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=PRON\|Person=3`, `Number=Plur\|POS=NOUN`, `NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=AUX\|Tense=Pres\|VerbForm=Part`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|Person=3`, `Number=Sing\|POS=NOUN`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Plur\|POS=PROPN`, `Number=Sing\|POS=PROPN`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Dem`, `Gender=Masc\|Number=Sing\|POS=DET`, `Gender=Fem\|Number=Sing\|POS=DET\|Poss=Yes`, `Gender=Masc\|POS=PRON`, `POS=NOUN`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON`, `Gender=Masc\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Number=Sing\|POS=PRON`, `Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Ind\|POS=VERB\|VerbForm=Fin`, `Number=Plur\|POS=DET\|PronType=Dem`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=PRON`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Dem`, `Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=PRON`, `POS=NUM`, `Gender=Fem\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=PRON`, `Number=Plur\|POS=PRON\|Person=3`, `Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|Person=1`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=INTJ`, `Number=Plur\|POS=PRON\|Person=2`, `NumType=Card\|POS=PRON`, `Definite=Ind\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `NumType=Card\|POS=NOUN`, `POS=PRON\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3`, `Gender=Fem\|Number=Sing\|POS=DET`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Ind\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Dem`, `Gender=Masc\|Number=Plur\|POS=PROPN`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Dem`, `Number=Sing\|POS=DET`, `Gender=Masc\|NumType=Card\|Number=Plur\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Dem`, `Mood=Ind\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|POS=PRON`, `Gender=Masc\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=X`, `POS=SYM`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Int`, `POS=DET`, `Gender=Masc\|Number=Plur\|POS=PRON`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Person=3\|VerbForm=Fin`, `Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Masc\|Number=Plur\|POS=DET`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Rel`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Rel`, `POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Fem\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Imp\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|Reflex=Yes`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=1\|Reflex=Yes`, `Gender=Masc\|NumType=Card\|Number=Sing\|POS=NOUN`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Number=Sing\|POS=PRON\|Person=1\|Reflex=Yes`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|POS=PROPN`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=PROPN`, `Gender=Masc\|NumType=Card\|POS=NUM` |
84
  | **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux:pass`, `aux:tense`, `case`, `cc`, `ccomp`, `conj`, `cop`, `dep`, `det`, `expl:comp`, `expl:pass`, `expl:subj`, `fixed`, `flat:foreign`, `flat:name`, `iobj`, `mark`, `nmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl:agent`, `obl:arg`, `obl:mod`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
  | **`senter`** | `I`, `S` |
86
  | **`ner`** | `LOC`, `MISC`, `ORG`, `PER` |
 
92
  | Type | Score |
93
  | --- | --- |
94
  | `TOKEN_ACC` | 99.90 |
95
+ | `TOKEN_P` | 98.44 |
96
+ | `TOKEN_R` | 98.96 |
97
+ | `TOKEN_F` | 98.70 |
98
+ | `POS_ACC` | 96.01 |
99
+ | `MORPH_ACC` | 94.93 |
100
+ | `MORPH_MICRO_P` | 97.66 |
101
+ | `MORPH_MICRO_R` | 96.48 |
102
+ | `MORPH_MICRO_F` | 97.07 |
103
+ | `SENTS_P` | 86.59 |
104
+ | `SENTS_R` | 89.32 |
105
+ | `SENTS_F` | 87.93 |
106
+ | `DEP_UAS` | 87.70 |
107
+ | `DEP_LAS` | 83.26 |
108
+ | `TAG_ACC` | 93.12 |
109
+ | `LEMMA_ACC` | 90.31 |
110
+ | `ENTS_P` | 81.22 |
111
+ | `ENTS_R` | 80.81 |
112
+ | `ENTS_F` | 81.01 |
accuracy.json CHANGED
@@ -1,70 +1,68 @@
1
  {
2
  "token_acc": 0.9989751998,
3
- "tag_acc": 0.934480272,
4
- "pos_acc": 0.9632224168,
5
- "morph_acc": 0.952690167,
6
- "lemma_acc": 0.9032722042,
7
- "dep_uas": 0.8787913869,
8
- "dep_las": 0.8382242287,
9
- "sents_p": 0.8642857143,
10
- "sents_r": 0.8902232487,
11
- "sents_f": 0.8725961538,
12
- "speed": 5408.6948387533,
13
  "morph_per_feat": {
14
  "Definite": {
15
- "p": 0.9853907962,
16
- "r": 0.9839533187,
17
- "f": 0.9846715328
18
  },
19
  "Number": {
20
- "p": 0.9923163418,
21
- "r": 0.9812824314,
22
- "f": 0.9867685427
23
  },
24
  "PronType": {
25
- "p": 0.99356085,
26
- "r": 0.98657289,
27
- "f": 0.9900545396
28
  },
29
  "Gender": {
30
- "p": 0.9726847034,
31
- "r": 0.9661498708,
32
- "f": 0.9694062743
33
  },
34
  "Mood": {
35
- "p": 0.9670932358,
36
- "r": 0.9396092362,
37
- "f": 0.9531531532
38
  },
39
  "Person": {
40
- "p": 0.9805950841,
41
- "r": 0.9522613065,
42
- "f": 0.9662205226
43
  },
44
  "Tense": {
45
- "p": 0.9441571872,
46
- "r": 0.9325842697,
47
- "f": 0.9383350462
48
  },
49
  "VerbForm": {
50
- "p": 0.9672818792,
51
- "r": 0.9544701987,
52
- "f": 0.9608333333
53
  },
54
  "NumType": {
55
- "p": 0.9857651246,
56
- "r": 0.9551724138,
57
- "f": 0.9702276708
58
  },
59
  "Reflex": {
60
- "p": 0.9777777778,
61
  "r": 1.0,
62
- "f": 0.9887640449
63
  },
64
  "Voice": {
65
- "p": 0.8833333333,
66
- "r": 0.9464285714,
67
- "f": 0.9137931034
68
  },
69
  "Poss": {
70
  "p": 1.0,
@@ -72,141 +70,146 @@
72
  "f": 1.0
73
  },
74
  "Polarity": {
75
- "p": 0.9882352941,
76
- "r": 0.9882352941,
77
- "f": 0.9882352941
78
  }
79
  },
 
 
 
 
 
80
  "dep_las_per_type": {
81
  "det": {
82
- "p": 0.9746938776,
83
- "r": 0.9636803874,
84
- "f": 0.9691558442
85
  },
86
  "nsubj": {
87
- "p": 0.8484848485,
88
- "r": 0.8096385542,
89
- "f": 0.8286066584
90
  },
91
  "aux:tense": {
92
- "p": 0.9268292683,
93
- "r": 0.912,
94
- "f": 0.9193548387
95
  },
96
  "root": {
97
- "p": 0.8392434988,
98
- "r": 0.8616504854,
99
- "f": 0.8502994012
100
  },
101
  "obj": {
102
- "p": 0.8373493976,
103
- "r": 0.824925816,
104
- "f": 0.8310911809
105
  },
106
  "cc": {
107
- "p": 0.871559633,
108
- "r": 0.8755760369,
109
- "f": 0.8735632184
110
  },
111
  "case": {
112
- "p": 0.9602960969,
113
- "r": 0.9720708447,
114
- "f": 0.9661475965
115
  },
116
  "obl:mod": {
117
- "p": 0.6355140187,
118
- "r": 0.6071428571,
119
- "f": 0.6210045662
120
  },
121
  "nmod": {
122
- "p": 0.790368272,
123
- "r": 0.8353293413,
124
- "f": 0.8122270742
125
  },
126
  "conj": {
127
- "p": 0.52,
128
- "r": 0.5118110236,
129
- "f": 0.5158730159
130
  },
131
  "nummod": {
132
- "p": 0.93125,
133
- "r": 0.8869047619,
134
- "f": 0.9085365854
135
  },
136
  "amod": {
137
- "p": 0.8818181818,
138
- "r": 0.8850364964,
139
- "f": 0.883424408
140
  },
141
  "acl": {
142
- "p": 0.6781609195,
143
- "r": 0.6820809249,
144
- "f": 0.6801152738
145
  },
146
  "mark": {
147
- "p": 0.8755760369,
148
- "r": 0.8370044053,
149
- "f": 0.8558558559
150
  },
151
  "xcomp": {
152
- "p": 0.8285714286,
153
- "r": 0.7682119205,
154
- "f": 0.7972508591
155
  },
156
  "flat:name": {
157
- "p": 0.9223300971,
158
  "r": 0.9047619048,
159
- "f": 0.9134615385
160
  },
161
  "cop": {
162
- "p": 0.808988764,
163
  "r": 0.8,
164
- "f": 0.8044692737
165
  },
166
  "advmod": {
167
- "p": 0.8121019108,
168
- "r": 0.7993730408,
169
- "f": 0.8056872038
170
  },
171
  "obl:arg": {
172
- "p": 0.6714975845,
173
- "r": 0.6318181818,
174
- "f": 0.6510538642
175
  },
176
  "appos": {
177
- "p": 0.4831460674,
178
- "r": 0.5180722892,
179
- "f": 0.5
180
  },
181
  "nsubj:pass": {
182
- "p": 0.8295454545,
183
- "r": 0.8588235294,
184
- "f": 0.8439306358
185
  },
186
  "aux:pass": {
187
- "p": 0.905982906,
188
  "r": 0.9464285714,
189
- "f": 0.9257641921
190
  },
191
  "acl:relcl": {
192
- "p": 0.6506024096,
193
- "r": 0.6279069767,
194
- "f": 0.6390532544
195
  },
196
  "advcl": {
197
- "p": 0.4736842105,
198
- "r": 0.4615384615,
199
- "f": 0.4675324675
200
  },
201
  "fixed": {
202
- "p": 0.8351648352,
203
- "r": 0.7524752475,
204
- "f": 0.7916666667
205
  },
206
  "dep": {
207
- "p": 0.3111111111,
208
- "r": 0.4516129032,
209
- "f": 0.3684210526
210
  },
211
  "expl:subj": {
212
  "p": 0.7058823529,
@@ -214,34 +217,34 @@
214
  "f": 0.7272727273
215
  },
216
  "expl:comp": {
217
- "p": 0.725,
218
- "r": 0.9666666667,
219
- "f": 0.8285714286
220
  },
221
  "expl:pass": {
222
- "p": 0.6,
223
- "r": 0.4285714286,
224
- "f": 0.5
 
 
 
 
 
225
  },
226
  "ccomp": {
227
- "p": 0.6296296296,
228
- "r": 0.6666666667,
229
- "f": 0.6476190476
230
  },
231
  "parataxis": {
232
- "p": 0.4333333333,
233
- "r": 0.4642857143,
234
- "f": 0.4482758621
235
  },
236
  "iobj": {
237
- "p": 0.7333333333,
238
- "r": 0.44,
239
- "f": 0.55
240
- },
241
- "obl:agent": {
242
- "p": 0.9459459459,
243
- "r": 0.8333333333,
244
- "f": 0.8860759494
245
  },
246
  "nsubj:caus": {
247
  "p": 0.0,
@@ -264,9 +267,9 @@
264
  "f": 0.0
265
  },
266
  "vocative": {
267
- "p": 0.8333333333,
268
  "r": 0.625,
269
- "f": 0.7142857143
270
  },
271
  "dislocated": {
272
  "p": 0.0,
@@ -274,9 +277,9 @@
274
  "f": 0.0
275
  },
276
  "flat:foreign": {
277
- "p": 1.0,
278
- "r": 0.1428571429,
279
- "f": 0.25
280
  },
281
  "orphan": {
282
  "p": 0.0,
@@ -294,29 +297,32 @@
294
  "f": 0.0
295
  }
296
  },
297
- "ents_p": 0.8141885091,
298
- "ents_r": 0.8094952164,
299
- "ents_f": 0.8118350797,
 
 
300
  "ents_per_type": {
301
  "PER": {
302
- "p": 0.870535084,
303
- "r": 0.883570968,
304
- "f": 0.877004587
305
  },
306
  "LOC": {
307
- "p": 0.8215006799,
308
- "r": 0.8339747344,
309
- "f": 0.8276907109
310
  },
311
  "ORG": {
312
- "p": 0.7624722278,
313
- "r": 0.7204198473,
314
- "f": 0.7408497694
315
  },
316
  "MISC": {
317
- "p": 0.6997455471,
318
- "r": 0.641492929,
319
- "f": 0.6693542253
320
  }
321
- }
 
322
  }
 
1
  {
2
  "token_acc": 0.9989751998,
3
+ "token_p": 0.9844389844,
4
+ "token_r": 0.9896058454,
5
+ "token_f": 0.9870156531,
6
+ "pos_acc": 0.9600618397,
7
+ "morph_acc": 0.9492783505,
8
+ "morph_micro_p": 0.9765677992,
9
+ "morph_micro_r": 0.9648470818,
10
+ "morph_micro_f": 0.9706720603,
 
 
11
  "morph_per_feat": {
12
  "Definite": {
13
+ "p": 0.986100951,
14
+ "r": 0.9839416058,
15
+ "f": 0.985020095
16
  },
17
  "Number": {
18
+ "p": 0.9906838085,
19
+ "r": 0.9788291605,
20
+ "f": 0.9847208075
21
  },
22
  "PronType": {
23
+ "p": 0.992916935,
24
+ "r": 0.9865642994,
25
+ "f": 0.9897304236
26
  },
27
  "Gender": {
28
+ "p": 0.9705502454,
29
+ "r": 0.9601328904,
30
+ "f": 0.9653134635
31
  },
32
  "Mood": {
33
+ "p": 0.9575645756,
34
+ "r": 0.9218472469,
35
+ "f": 0.9393665158
36
  },
37
  "Person": {
38
+ "p": 0.9726205997,
39
+ "r": 0.9383647799,
40
+ "f": 0.9551856594
41
  },
42
  "Tense": {
43
+ "p": 0.936918304,
44
+ "r": 0.9254341164,
45
+ "f": 0.9311408016
46
  },
47
  "VerbForm": {
48
+ "p": 0.9538977368,
49
+ "r": 0.9420529801,
50
+ "f": 0.947938359
51
  },
52
  "NumType": {
53
+ "p": 0.9858156028,
54
+ "r": 0.9488054608,
55
+ "f": 0.9669565217
56
  },
57
  "Reflex": {
58
+ "p": 0.9565217391,
59
  "r": 1.0,
60
+ "f": 0.9777777778
61
  },
62
  "Voice": {
63
+ "p": 0.8429752066,
64
+ "r": 0.9107142857,
65
+ "f": 0.8755364807
66
  },
67
  "Poss": {
68
  "p": 1.0,
 
70
  "f": 1.0
71
  },
72
  "Polarity": {
73
+ "p": 0.9880952381,
74
+ "r": 0.9764705882,
75
+ "f": 0.9822485207
76
  }
77
  },
78
+ "sents_p": 0.8658823529,
79
+ "sents_r": 0.8932038835,
80
+ "sents_f": 0.8793309438,
81
+ "dep_uas": 0.8770041095,
82
+ "dep_las": 0.832561907,
83
  "dep_las_per_type": {
84
  "det": {
85
+ "p": 0.9724919094,
86
+ "r": 0.9701372074,
87
+ "f": 0.9713131313
88
  },
89
  "nsubj": {
90
+ "p": 0.8618925831,
91
+ "r": 0.8120481928,
92
+ "f": 0.8362282878
93
  },
94
  "aux:tense": {
95
+ "p": 0.9206349206,
96
+ "r": 0.928,
97
+ "f": 0.9243027888
98
  },
99
  "root": {
100
+ "p": 0.853427896,
101
+ "r": 0.8762135922,
102
+ "f": 0.8646706587
103
  },
104
  "obj": {
105
+ "p": 0.8171091445,
106
+ "r": 0.821958457,
107
+ "f": 0.8195266272
108
  },
109
  "cc": {
110
+ "p": 0.869955157,
111
+ "r": 0.8940092166,
112
+ "f": 0.8818181818
113
  },
114
  "case": {
115
+ "p": 0.9600811908,
116
+ "r": 0.9666212534,
117
+ "f": 0.9633401222
118
  },
119
  "obl:mod": {
120
+ "p": 0.6214511041,
121
+ "r": 0.5880597015,
122
+ "f": 0.6042944785
123
  },
124
  "nmod": {
125
+ "p": 0.7838095238,
126
+ "r": 0.8221778222,
127
+ "f": 0.8025353486
128
  },
129
  "conj": {
130
+ "p": 0.5307692308,
131
+ "r": 0.5433070866,
132
+ "f": 0.5369649805
133
  },
134
  "nummod": {
135
+ "p": 0.9210526316,
136
+ "r": 0.8284023669,
137
+ "f": 0.8722741433
138
  },
139
  "amod": {
140
+ "p": 0.8683729433,
141
+ "r": 0.8652094718,
142
+ "f": 0.8667883212
143
  },
144
  "acl": {
145
+ "p": 0.6411764706,
146
+ "r": 0.6300578035,
147
+ "f": 0.6355685131
148
  },
149
  "mark": {
150
+ "p": 0.9052132701,
151
+ "r": 0.8414096916,
152
+ "f": 0.8721461187
153
  },
154
  "xcomp": {
155
+ "p": 0.8,
156
+ "r": 0.7947019868,
157
+ "f": 0.7973421927
158
  },
159
  "flat:name": {
160
+ "p": 0.8482142857,
161
  "r": 0.9047619048,
162
+ "f": 0.8755760369
163
  },
164
  "cop": {
165
+ "p": 0.8571428571,
166
  "r": 0.8,
167
+ "f": 0.8275862069
168
  },
169
  "advmod": {
170
+ "p": 0.8338658147,
171
+ "r": 0.8181818182,
172
+ "f": 0.8259493671
173
  },
174
  "obl:arg": {
175
+ "p": 0.6553398058,
176
+ "r": 0.6136363636,
177
+ "f": 0.6338028169
178
  },
179
  "appos": {
180
+ "p": 0.417721519,
181
+ "r": 0.3975903614,
182
+ "f": 0.4074074074
183
  },
184
  "nsubj:pass": {
185
+ "p": 0.7717391304,
186
+ "r": 0.8352941176,
187
+ "f": 0.802259887
188
  },
189
  "aux:pass": {
190
+ "p": 0.9137931034,
191
  "r": 0.9464285714,
192
+ "f": 0.9298245614
193
  },
194
  "acl:relcl": {
195
+ "p": 0.5714285714,
196
+ "r": 0.6046511628,
197
+ "f": 0.5875706215
198
  },
199
  "advcl": {
200
+ "p": 0.4929577465,
201
+ "r": 0.4487179487,
202
+ "f": 0.4697986577
203
  },
204
  "fixed": {
205
+ "p": 0.691588785,
206
+ "r": 0.74,
207
+ "f": 0.7149758454
208
  },
209
  "dep": {
210
+ "p": 0.2884615385,
211
+ "r": 0.5172413793,
212
+ "f": 0.3703703704
213
  },
214
  "expl:subj": {
215
  "p": 0.7058823529,
 
217
  "f": 0.7272727273
218
  },
219
  "expl:comp": {
220
+ "p": 0.7428571429,
221
+ "r": 0.8666666667,
222
+ "f": 0.8
223
  },
224
  "expl:pass": {
225
+ "p": 0.4,
226
+ "r": 0.2857142857,
227
+ "f": 0.3333333333
228
+ },
229
+ "obl:agent": {
230
+ "p": 0.8205128205,
231
+ "r": 0.7619047619,
232
+ "f": 0.7901234568
233
  },
234
  "ccomp": {
235
+ "p": 0.6603773585,
236
+ "r": 0.6862745098,
237
+ "f": 0.6730769231
238
  },
239
  "parataxis": {
240
+ "p": 0.36,
241
+ "r": 0.3214285714,
242
+ "f": 0.3396226415
243
  },
244
  "iobj": {
245
+ "p": 0.7,
246
+ "r": 0.56,
247
+ "f": 0.6222222222
 
 
 
 
 
248
  },
249
  "nsubj:caus": {
250
  "p": 0.0,
 
267
  "f": 0.0
268
  },
269
  "vocative": {
270
+ "p": 1.0,
271
  "r": 0.625,
272
+ "f": 0.7692307692
273
  },
274
  "dislocated": {
275
  "p": 0.0,
 
277
  "f": 0.0
278
  },
279
  "flat:foreign": {
280
+ "p": 0.0,
281
+ "r": 0.0,
282
+ "f": 0.0
283
  },
284
  "orphan": {
285
  "p": 0.0,
 
297
  "f": 0.0
298
  }
299
  },
300
+ "tag_acc": 0.9312032981,
301
+ "lemma_acc": 0.9031031648,
302
+ "ents_p": 0.8121504727,
303
+ "ents_r": 0.8080541211,
304
+ "ents_f": 0.8100971185,
305
  "ents_per_type": {
306
  "PER": {
307
+ "p": 0.8685030449,
308
+ "r": 0.8787705094,
309
+ "f": 0.8736066099
310
  },
311
  "LOC": {
312
+ "p": 0.8245838668,
313
+ "r": 0.835104158,
314
+ "f": 0.8298106698
315
  },
316
  "ORG": {
317
+ "p": 0.7541699762,
318
+ "r": 0.7248091603,
319
+ "f": 0.7391981316
320
  },
321
  "MISC": {
322
+ "p": 0.6852231509,
323
+ "r": 0.6334742674,
324
+ "f": 0.6583333333
325
  }
326
+ },
327
+ "speed": 4222.2093213177
328
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
 
config.cfg CHANGED
@@ -1,10 +1,8 @@
1
  [paths]
2
- train = "corpus/fr-dep-news/train.spacy"
3
- dev = "corpus/fr-dep-news/dev.spacy"
4
  vectors = null
5
- raw = null
6
  init_tok2vec = null
7
- vocab_data = null
8
 
9
  [system]
10
  gpu_allocator = null
@@ -24,6 +22,7 @@ tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
24
 
25
  [components.attribute_ruler]
26
  factory = "attribute_ruler"
 
27
  validate = false
28
 
29
  [components.lemmatizer]
@@ -31,9 +30,13 @@ factory = "lemmatizer"
31
  mode = "rule"
32
  model = null
33
  overwrite = false
 
34
 
35
  [components.morphologizer]
36
  factory = "morphologizer"
 
 
 
37
 
38
  [components.morphologizer.model]
39
  @architectures = "spacy.Tagger.v1"
@@ -48,6 +51,7 @@ upstream = "tok2vec"
48
  factory = "ner"
49
  incorrect_spans_key = null
50
  moves = null
 
51
  update_with_oracle_cut_size = 100
52
 
53
  [components.ner.model]
@@ -65,8 +69,8 @@ nO = null
65
  [components.ner.model.tok2vec.embed]
66
  @architectures = "spacy.MultiHashEmbed.v2"
67
  width = 96
68
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
69
- rows = [5000,2500,2500,2500]
70
  include_static_vectors = false
71
 
72
  [components.ner.model.tok2vec.encode]
@@ -81,6 +85,7 @@ factory = "parser"
81
  learn_tokens = false
82
  min_action_freq = 30
83
  moves = null
 
84
  update_with_oracle_cut_size = 100
85
 
86
  [components.parser.model]
@@ -99,6 +104,8 @@ upstream = "tok2vec"
99
 
100
  [components.senter]
101
  factory = "senter"
 
 
102
 
103
  [components.senter.model]
104
  @architectures = "spacy.Tagger.v1"
@@ -110,8 +117,8 @@ nO = null
110
  [components.senter.model.tok2vec.embed]
111
  @architectures = "spacy.MultiHashEmbed.v2"
112
  width = 16
113
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
114
- rows = [1000,500,500,500]
115
  include_static_vectors = false
116
 
117
  [components.senter.model.tok2vec.encode]
@@ -130,8 +137,8 @@ factory = "tok2vec"
130
  [components.tok2vec.model.embed]
131
  @architectures = "spacy.MultiHashEmbed.v2"
132
  width = ${components.tok2vec.model.encode:width}
133
- attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
134
- rows = [5000,2500,2500,2500]
135
  include_static_vectors = false
136
 
137
  [components.tok2vec.model.encode]
@@ -145,22 +152,19 @@ maxout_pieces = 3
145
 
146
  [corpora.dev]
147
  @readers = "spacy.Corpus.v1"
148
- limit = 0
149
- max_length = 0
150
- path = ${paths:dev}
151
  gold_preproc = false
 
 
152
  augmenter = null
153
 
154
  [corpora.train]
155
  @readers = "spacy.Corpus.v1"
156
- path = ${paths:train}
157
- max_length = 5000
158
  gold_preproc = false
 
159
  limit = 0
160
-
161
- [corpora.train.augmenter]
162
- @augmenters = "spacy.lower_case.v1"
163
- level = 0.1
164
 
165
  [training]
166
  train_corpus = "corpora.train"
@@ -191,9 +195,8 @@ compound = 1.001
191
  t = 0.0
192
 
193
  [training.logger]
194
- @loggers = "spacy.WandbLogger.v1"
195
- project_name = "spacy-v3.0.0a2"
196
- remove_config_values = []
197
 
198
  [training.optimizer]
199
  @optimizers = "Adam.v1"
@@ -216,16 +219,17 @@ dep_las_per_type = null
216
  sents_p = null
217
  sents_r = null
218
  sents_f = 0.02
219
- lemma_acc = 0.33
220
- ents_f = 0.33
221
  ents_p = 0.0
222
  ents_r = 0.0
223
  ents_per_type = null
 
224
 
225
  [pretraining]
226
 
227
  [initialize]
228
- vocab_data = ${paths.vocab_data}
229
  vectors = ${paths.vectors}
230
  init_tok2vec = ${paths.init_tok2vec}
231
  before_init = null
 
1
  [paths]
2
+ train = null
3
+ dev = null
4
  vectors = null
 
5
  init_tok2vec = null
 
6
 
7
  [system]
8
  gpu_allocator = null
 
22
 
23
  [components.attribute_ruler]
24
  factory = "attribute_ruler"
25
+ scorer = {"@scorers":"spacy.attribute_ruler_scorer.v1"}
26
  validate = false
27
 
28
  [components.lemmatizer]
 
30
  mode = "rule"
31
  model = null
32
  overwrite = false
33
+ scorer = {"@scorers":"spacy.lemmatizer_scorer.v1"}
34
 
35
  [components.morphologizer]
36
  factory = "morphologizer"
37
+ extend = false
38
+ overwrite = true
39
+ scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
40
 
41
  [components.morphologizer.model]
42
  @architectures = "spacy.Tagger.v1"
 
51
  factory = "ner"
52
  incorrect_spans_key = null
53
  moves = null
54
+ scorer = {"@scorers":"spacy.ner_scorer.v1"}
55
  update_with_oracle_cut_size = 100
56
 
57
  [components.ner.model]
 
69
  [components.ner.model.tok2vec.embed]
70
  @architectures = "spacy.MultiHashEmbed.v2"
71
  width = 96
72
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
73
+ rows = [5000,2500,2500,2500,100]
74
  include_static_vectors = false
75
 
76
  [components.ner.model.tok2vec.encode]
 
85
  learn_tokens = false
86
  min_action_freq = 30
87
  moves = null
88
+ scorer = {"@scorers":"spacy.parser_scorer.v1"}
89
  update_with_oracle_cut_size = 100
90
 
91
  [components.parser.model]
 
104
 
105
  [components.senter]
106
  factory = "senter"
107
+ overwrite = false
108
+ scorer = {"@scorers":"spacy.senter_scorer.v1"}
109
 
110
  [components.senter.model]
111
  @architectures = "spacy.Tagger.v1"
 
117
  [components.senter.model.tok2vec.embed]
118
  @architectures = "spacy.MultiHashEmbed.v2"
119
  width = 16
120
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
121
+ rows = [1000,500,500,500,50]
122
  include_static_vectors = false
123
 
124
  [components.senter.model.tok2vec.encode]
 
137
  [components.tok2vec.model.embed]
138
  @architectures = "spacy.MultiHashEmbed.v2"
139
  width = ${components.tok2vec.model.encode:width}
140
+ attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
141
+ rows = [5000,2500,2500,2500,100]
142
  include_static_vectors = false
143
 
144
  [components.tok2vec.model.encode]
 
152
 
153
  [corpora.dev]
154
  @readers = "spacy.Corpus.v1"
155
+ path = ${paths.dev}
 
 
156
  gold_preproc = false
157
+ max_length = 0
158
+ limit = 0
159
  augmenter = null
160
 
161
  [corpora.train]
162
  @readers = "spacy.Corpus.v1"
163
+ path = ${paths.train}
 
164
  gold_preproc = false
165
+ max_length = 0
166
  limit = 0
167
+ augmenter = null
 
 
 
168
 
169
  [training]
170
  train_corpus = "corpora.train"
 
195
  t = 0.0
196
 
197
  [training.logger]
198
+ @loggers = "spacy.ConsoleLogger.v1"
199
+ progress_bar = false
 
200
 
201
  [training.optimizer]
202
  @optimizers = "Adam.v1"
 
219
  sents_p = null
220
  sents_r = null
221
  sents_f = 0.02
222
+ lemma_acc = 0.5
223
+ ents_f = 0.16
224
  ents_p = 0.0
225
  ents_r = 0.0
226
  ents_per_type = null
227
+ speed = 0.0
228
 
229
  [pretraining]
230
 
231
  [initialize]
232
+ vocab_data = null
233
  vectors = ${paths.vectors}
234
  init_tok2vec = ${paths.init_tok2vec}
235
  before_init = null
fr_core_news_sm-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:75feba5127c0d927e9309741907265733b67b1856ac1273f3be3776a9ca70edf
3
- size 17084756
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec38039523b1b64535f1c7d11ce45e6629404a50fa083f193d5555d9f6ac1a30
3
+ size 17362258
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"fr",
3
  "name":"core_news_sm",
4
- "version":"3.1.0",
5
  "description":"French pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"LGPL-LR",
10
- "spacy_version":">=3.1.0,<3.2.0",
11
- "spacy_git_version":"caba63b74",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
@@ -173,7 +173,6 @@
173
  "Gender=Fem|Number=Plur|POS=DET|PronType=Int",
174
  "POS=DET",
175
  "Gender=Masc|Number=Plur|POS=PRON",
176
- "POS=PART",
177
  "Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin",
178
  "Mood=Ind|POS=VERB|Person=3|VerbForm=Fin",
179
  "Number=Sing|POS=VERB|Tense=Past|VerbForm=Part|Voice=Pass",
@@ -213,7 +212,6 @@
213
  "Mood=Imp|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin",
214
  "Mood=Sub|Number=Plur|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin",
215
  "Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Imp|VerbForm=Fin",
216
- "Gender=Fem|POS=ADV",
217
  "Mood=Ind|Number=Sing|POS=AUX|Person=2|Tense=Imp|VerbForm=Fin",
218
  "Number=Plur|POS=VERB|Tense=Past|VerbForm=Part",
219
  "Gender=Fem|Number=Plur|POS=PROPN",
@@ -296,71 +294,69 @@
296
  ],
297
  "performance":{
298
  "token_acc":0.9989751998,
299
- "tag_acc":0.934480272,
300
- "pos_acc":0.9632224168,
301
- "morph_acc":0.952690167,
302
- "lemma_acc":0.9032722042,
303
- "dep_uas":0.8787913869,
304
- "dep_las":0.8382242287,
305
- "sents_p":0.8642857143,
306
- "sents_r":0.8902232487,
307
- "sents_f":0.8725961538,
308
- "speed":5408.6948387533,
309
  "morph_per_feat":{
310
  "Definite":{
311
- "p":0.9853907962,
312
- "r":0.9839533187,
313
- "f":0.9846715328
314
  },
315
  "Number":{
316
- "p":0.9923163418,
317
- "r":0.9812824314,
318
- "f":0.9867685427
319
  },
320
  "PronType":{
321
- "p":0.99356085,
322
- "r":0.98657289,
323
- "f":0.9900545396
324
  },
325
  "Gender":{
326
- "p":0.9726847034,
327
- "r":0.9661498708,
328
- "f":0.9694062743
329
  },
330
  "Mood":{
331
- "p":0.9670932358,
332
- "r":0.9396092362,
333
- "f":0.9531531532
334
  },
335
  "Person":{
336
- "p":0.9805950841,
337
- "r":0.9522613065,
338
- "f":0.9662205226
339
  },
340
  "Tense":{
341
- "p":0.9441571872,
342
- "r":0.9325842697,
343
- "f":0.9383350462
344
  },
345
  "VerbForm":{
346
- "p":0.9672818792,
347
- "r":0.9544701987,
348
- "f":0.9608333333
349
  },
350
  "NumType":{
351
- "p":0.9857651246,
352
- "r":0.9551724138,
353
- "f":0.9702276708
354
  },
355
  "Reflex":{
356
- "p":0.9777777778,
357
  "r":1.0,
358
- "f":0.9887640449
359
  },
360
  "Voice":{
361
- "p":0.8833333333,
362
- "r":0.9464285714,
363
- "f":0.9137931034
364
  },
365
  "Poss":{
366
  "p":1.0,
@@ -368,141 +364,146 @@
368
  "f":1.0
369
  },
370
  "Polarity":{
371
- "p":0.9882352941,
372
- "r":0.9882352941,
373
- "f":0.9882352941
374
  }
375
  },
 
 
 
 
 
376
  "dep_las_per_type":{
377
  "det":{
378
- "p":0.9746938776,
379
- "r":0.9636803874,
380
- "f":0.9691558442
381
  },
382
  "nsubj":{
383
- "p":0.8484848485,
384
- "r":0.8096385542,
385
- "f":0.8286066584
386
  },
387
  "aux:tense":{
388
- "p":0.9268292683,
389
- "r":0.912,
390
- "f":0.9193548387
391
  },
392
  "root":{
393
- "p":0.8392434988,
394
- "r":0.8616504854,
395
- "f":0.8502994012
396
  },
397
  "obj":{
398
- "p":0.8373493976,
399
- "r":0.824925816,
400
- "f":0.8310911809
401
  },
402
  "cc":{
403
- "p":0.871559633,
404
- "r":0.8755760369,
405
- "f":0.8735632184
406
  },
407
  "case":{
408
- "p":0.9602960969,
409
- "r":0.9720708447,
410
- "f":0.9661475965
411
  },
412
  "obl:mod":{
413
- "p":0.6355140187,
414
- "r":0.6071428571,
415
- "f":0.6210045662
416
  },
417
  "nmod":{
418
- "p":0.790368272,
419
- "r":0.8353293413,
420
- "f":0.8122270742
421
  },
422
  "conj":{
423
- "p":0.52,
424
- "r":0.5118110236,
425
- "f":0.5158730159
426
  },
427
  "nummod":{
428
- "p":0.93125,
429
- "r":0.8869047619,
430
- "f":0.9085365854
431
  },
432
  "amod":{
433
- "p":0.8818181818,
434
- "r":0.8850364964,
435
- "f":0.883424408
436
  },
437
  "acl":{
438
- "p":0.6781609195,
439
- "r":0.6820809249,
440
- "f":0.6801152738
441
  },
442
  "mark":{
443
- "p":0.8755760369,
444
- "r":0.8370044053,
445
- "f":0.8558558559
446
  },
447
  "xcomp":{
448
- "p":0.8285714286,
449
- "r":0.7682119205,
450
- "f":0.7972508591
451
  },
452
  "flat:name":{
453
- "p":0.9223300971,
454
  "r":0.9047619048,
455
- "f":0.9134615385
456
  },
457
  "cop":{
458
- "p":0.808988764,
459
  "r":0.8,
460
- "f":0.8044692737
461
  },
462
  "advmod":{
463
- "p":0.8121019108,
464
- "r":0.7993730408,
465
- "f":0.8056872038
466
  },
467
  "obl:arg":{
468
- "p":0.6714975845,
469
- "r":0.6318181818,
470
- "f":0.6510538642
471
  },
472
  "appos":{
473
- "p":0.4831460674,
474
- "r":0.5180722892,
475
- "f":0.5
476
  },
477
  "nsubj:pass":{
478
- "p":0.8295454545,
479
- "r":0.8588235294,
480
- "f":0.8439306358
481
  },
482
  "aux:pass":{
483
- "p":0.905982906,
484
  "r":0.9464285714,
485
- "f":0.9257641921
486
  },
487
  "acl:relcl":{
488
- "p":0.6506024096,
489
- "r":0.6279069767,
490
- "f":0.6390532544
491
  },
492
  "advcl":{
493
- "p":0.4736842105,
494
- "r":0.4615384615,
495
- "f":0.4675324675
496
  },
497
  "fixed":{
498
- "p":0.8351648352,
499
- "r":0.7524752475,
500
- "f":0.7916666667
501
  },
502
  "dep":{
503
- "p":0.3111111111,
504
- "r":0.4516129032,
505
- "f":0.3684210526
506
  },
507
  "expl:subj":{
508
  "p":0.7058823529,
@@ -510,34 +511,34 @@
510
  "f":0.7272727273
511
  },
512
  "expl:comp":{
513
- "p":0.725,
514
- "r":0.9666666667,
515
- "f":0.8285714286
516
  },
517
  "expl:pass":{
518
- "p":0.6,
519
- "r":0.4285714286,
520
- "f":0.5
 
 
 
 
 
521
  },
522
  "ccomp":{
523
- "p":0.6296296296,
524
- "r":0.6666666667,
525
- "f":0.6476190476
526
  },
527
  "parataxis":{
528
- "p":0.4333333333,
529
- "r":0.4642857143,
530
- "f":0.4482758621
531
  },
532
  "iobj":{
533
- "p":0.7333333333,
534
- "r":0.44,
535
- "f":0.55
536
- },
537
- "obl:agent":{
538
- "p":0.9459459459,
539
- "r":0.8333333333,
540
- "f":0.8860759494
541
  },
542
  "nsubj:caus":{
543
  "p":0.0,
@@ -560,9 +561,9 @@
560
  "f":0.0
561
  },
562
  "vocative":{
563
- "p":0.8333333333,
564
  "r":0.625,
565
- "f":0.7142857143
566
  },
567
  "dislocated":{
568
  "p":0.0,
@@ -570,9 +571,9 @@
570
  "f":0.0
571
  },
572
  "flat:foreign":{
573
- "p":1.0,
574
- "r":0.1428571429,
575
- "f":0.25
576
  },
577
  "orphan":{
578
  "p":0.0,
@@ -590,35 +591,38 @@
590
  "f":0.0
591
  }
592
  },
593
- "ents_p":0.8141885091,
594
- "ents_r":0.8094952164,
595
- "ents_f":0.8118350797,
 
 
596
  "ents_per_type":{
597
  "PER":{
598
- "p":0.870535084,
599
- "r":0.883570968,
600
- "f":0.877004587
601
  },
602
  "LOC":{
603
- "p":0.8215006799,
604
- "r":0.8339747344,
605
- "f":0.8276907109
606
  },
607
  "ORG":{
608
- "p":0.7624722278,
609
- "r":0.7204198473,
610
- "f":0.7408497694
611
  },
612
  "MISC":{
613
- "p":0.6997455471,
614
- "r":0.641492929,
615
- "f":0.6693542253
616
  }
617
- }
 
618
  },
619
  "sources":[
620
  {
621
- "name":"UD French Sequoia v2.5",
622
  "url":"https://github.com/UniversalDependencies/UD_French-Sequoia",
623
  "license":"LGPL-LR",
624
  "author":"Candito, Marie; Seddah, Djam\u00e9; Perrier, Guy; Guillaume, Bruno"
 
1
  {
2
  "lang":"fr",
3
  "name":"core_news_sm",
4
+ "version":"3.2.0",
5
  "description":"French pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"LGPL-LR",
10
+ "spacy_version":">=3.2.0,<3.3.0",
11
+ "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
 
173
  "Gender=Fem|Number=Plur|POS=DET|PronType=Int",
174
  "POS=DET",
175
  "Gender=Masc|Number=Plur|POS=PRON",
 
176
  "Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin",
177
  "Mood=Ind|POS=VERB|Person=3|VerbForm=Fin",
178
  "Number=Sing|POS=VERB|Tense=Past|VerbForm=Part|Voice=Pass",
 
212
  "Mood=Imp|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin",
213
  "Mood=Sub|Number=Plur|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin",
214
  "Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Imp|VerbForm=Fin",
 
215
  "Mood=Ind|Number=Sing|POS=AUX|Person=2|Tense=Imp|VerbForm=Fin",
216
  "Number=Plur|POS=VERB|Tense=Past|VerbForm=Part",
217
  "Gender=Fem|Number=Plur|POS=PROPN",
 
294
  ],
295
  "performance":{
296
  "token_acc":0.9989751998,
297
+ "token_p":0.9844389844,
298
+ "token_r":0.9896058454,
299
+ "token_f":0.9870156531,
300
+ "pos_acc":0.9600618397,
301
+ "morph_acc":0.9492783505,
302
+ "morph_micro_p":0.9765677992,
303
+ "morph_micro_r":0.9648470818,
304
+ "morph_micro_f":0.9706720603,
 
 
305
  "morph_per_feat":{
306
  "Definite":{
307
+ "p":0.986100951,
308
+ "r":0.9839416058,
309
+ "f":0.985020095
310
  },
311
  "Number":{
312
+ "p":0.9906838085,
313
+ "r":0.9788291605,
314
+ "f":0.9847208075
315
  },
316
  "PronType":{
317
+ "p":0.992916935,
318
+ "r":0.9865642994,
319
+ "f":0.9897304236
320
  },
321
  "Gender":{
322
+ "p":0.9705502454,
323
+ "r":0.9601328904,
324
+ "f":0.9653134635
325
  },
326
  "Mood":{
327
+ "p":0.9575645756,
328
+ "r":0.9218472469,
329
+ "f":0.9393665158
330
  },
331
  "Person":{
332
+ "p":0.9726205997,
333
+ "r":0.9383647799,
334
+ "f":0.9551856594
335
  },
336
  "Tense":{
337
+ "p":0.936918304,
338
+ "r":0.9254341164,
339
+ "f":0.9311408016
340
  },
341
  "VerbForm":{
342
+ "p":0.9538977368,
343
+ "r":0.9420529801,
344
+ "f":0.947938359
345
  },
346
  "NumType":{
347
+ "p":0.9858156028,
348
+ "r":0.9488054608,
349
+ "f":0.9669565217
350
  },
351
  "Reflex":{
352
+ "p":0.9565217391,
353
  "r":1.0,
354
+ "f":0.9777777778
355
  },
356
  "Voice":{
357
+ "p":0.8429752066,
358
+ "r":0.9107142857,
359
+ "f":0.8755364807
360
  },
361
  "Poss":{
362
  "p":1.0,
 
364
  "f":1.0
365
  },
366
  "Polarity":{
367
+ "p":0.9880952381,
368
+ "r":0.9764705882,
369
+ "f":0.9822485207
370
  }
371
  },
372
+ "sents_p":0.8658823529,
373
+ "sents_r":0.8932038835,
374
+ "sents_f":0.8793309438,
375
+ "dep_uas":0.8770041095,
376
+ "dep_las":0.832561907,
377
  "dep_las_per_type":{
378
  "det":{
379
+ "p":0.9724919094,
380
+ "r":0.9701372074,
381
+ "f":0.9713131313
382
  },
383
  "nsubj":{
384
+ "p":0.8618925831,
385
+ "r":0.8120481928,
386
+ "f":0.8362282878
387
  },
388
  "aux:tense":{
389
+ "p":0.9206349206,
390
+ "r":0.928,
391
+ "f":0.9243027888
392
  },
393
  "root":{
394
+ "p":0.853427896,
395
+ "r":0.8762135922,
396
+ "f":0.8646706587
397
  },
398
  "obj":{
399
+ "p":0.8171091445,
400
+ "r":0.821958457,
401
+ "f":0.8195266272
402
  },
403
  "cc":{
404
+ "p":0.869955157,
405
+ "r":0.8940092166,
406
+ "f":0.8818181818
407
  },
408
  "case":{
409
+ "p":0.9600811908,
410
+ "r":0.9666212534,
411
+ "f":0.9633401222
412
  },
413
  "obl:mod":{
414
+ "p":0.6214511041,
415
+ "r":0.5880597015,
416
+ "f":0.6042944785
417
  },
418
  "nmod":{
419
+ "p":0.7838095238,
420
+ "r":0.8221778222,
421
+ "f":0.8025353486
422
  },
423
  "conj":{
424
+ "p":0.5307692308,
425
+ "r":0.5433070866,
426
+ "f":0.5369649805
427
  },
428
  "nummod":{
429
+ "p":0.9210526316,
430
+ "r":0.8284023669,
431
+ "f":0.8722741433
432
  },
433
  "amod":{
434
+ "p":0.8683729433,
435
+ "r":0.8652094718,
436
+ "f":0.8667883212
437
  },
438
  "acl":{
439
+ "p":0.6411764706,
440
+ "r":0.6300578035,
441
+ "f":0.6355685131
442
  },
443
  "mark":{
444
+ "p":0.9052132701,
445
+ "r":0.8414096916,
446
+ "f":0.8721461187
447
  },
448
  "xcomp":{
449
+ "p":0.8,
450
+ "r":0.7947019868,
451
+ "f":0.7973421927
452
  },
453
  "flat:name":{
454
+ "p":0.8482142857,
455
  "r":0.9047619048,
456
+ "f":0.8755760369
457
  },
458
  "cop":{
459
+ "p":0.8571428571,
460
  "r":0.8,
461
+ "f":0.8275862069
462
  },
463
  "advmod":{
464
+ "p":0.8338658147,
465
+ "r":0.8181818182,
466
+ "f":0.8259493671
467
  },
468
  "obl:arg":{
469
+ "p":0.6553398058,
470
+ "r":0.6136363636,
471
+ "f":0.6338028169
472
  },
473
  "appos":{
474
+ "p":0.417721519,
475
+ "r":0.3975903614,
476
+ "f":0.4074074074
477
  },
478
  "nsubj:pass":{
479
+ "p":0.7717391304,
480
+ "r":0.8352941176,
481
+ "f":0.802259887
482
  },
483
  "aux:pass":{
484
+ "p":0.9137931034,
485
  "r":0.9464285714,
486
+ "f":0.9298245614
487
  },
488
  "acl:relcl":{
489
+ "p":0.5714285714,
490
+ "r":0.6046511628,
491
+ "f":0.5875706215
492
  },
493
  "advcl":{
494
+ "p":0.4929577465,
495
+ "r":0.4487179487,
496
+ "f":0.4697986577
497
  },
498
  "fixed":{
499
+ "p":0.691588785,
500
+ "r":0.74,
501
+ "f":0.7149758454
502
  },
503
  "dep":{
504
+ "p":0.2884615385,
505
+ "r":0.5172413793,
506
+ "f":0.3703703704
507
  },
508
  "expl:subj":{
509
  "p":0.7058823529,
 
511
  "f":0.7272727273
512
  },
513
  "expl:comp":{
514
+ "p":0.7428571429,
515
+ "r":0.8666666667,
516
+ "f":0.8
517
  },
518
  "expl:pass":{
519
+ "p":0.4,
520
+ "r":0.2857142857,
521
+ "f":0.3333333333
522
+ },
523
+ "obl:agent":{
524
+ "p":0.8205128205,
525
+ "r":0.7619047619,
526
+ "f":0.7901234568
527
  },
528
  "ccomp":{
529
+ "p":0.6603773585,
530
+ "r":0.6862745098,
531
+ "f":0.6730769231
532
  },
533
  "parataxis":{
534
+ "p":0.36,
535
+ "r":0.3214285714,
536
+ "f":0.3396226415
537
  },
538
  "iobj":{
539
+ "p":0.7,
540
+ "r":0.56,
541
+ "f":0.6222222222
 
 
 
 
 
542
  },
543
  "nsubj:caus":{
544
  "p":0.0,
 
561
  "f":0.0
562
  },
563
  "vocative":{
564
+ "p":1.0,
565
  "r":0.625,
566
+ "f":0.7692307692
567
  },
568
  "dislocated":{
569
  "p":0.0,
 
571
  "f":0.0
572
  },
573
  "flat:foreign":{
574
+ "p":0.0,
575
+ "r":0.0,
576
+ "f":0.0
577
  },
578
  "orphan":{
579
  "p":0.0,
 
591
  "f":0.0
592
  }
593
  },
594
+ "tag_acc":0.9312032981,
595
+ "lemma_acc":0.9031031648,
596
+ "ents_p":0.8121504727,
597
+ "ents_r":0.8080541211,
598
+ "ents_f":0.8100971185,
599
  "ents_per_type":{
600
  "PER":{
601
+ "p":0.8685030449,
602
+ "r":0.8787705094,
603
+ "f":0.8736066099
604
  },
605
  "LOC":{
606
+ "p":0.8245838668,
607
+ "r":0.835104158,
608
+ "f":0.8298106698
609
  },
610
  "ORG":{
611
+ "p":0.7541699762,
612
+ "r":0.7248091603,
613
+ "f":0.7391981316
614
  },
615
  "MISC":{
616
+ "p":0.6852231509,
617
+ "r":0.6334742674,
618
+ "f":0.6583333333
619
  }
620
+ },
621
+ "speed":4222.2093213177
622
  },
623
  "sources":[
624
  {
625
+ "name":"UD French Sequoia v2.8",
626
  "url":"https://github.com/UniversalDependencies/UD_French-Sequoia",
627
  "license":"LGPL-LR",
628
  "author":"Candito, Marie; Seddah, Djam\u00e9; Perrier, Guy; Guillaume, Bruno"
morphologizer/cfg CHANGED
@@ -1,4 +1,5 @@
1
  {
 
2
  "labels_morph":{
3
  "POS=PROPN":"",
4
  "Gender=Fem|Number=Sing|POS=DET|PronType=Dem":"Gender=Fem|Number=Sing|PronType=Dem",
@@ -153,7 +154,6 @@
153
  "Gender=Fem|Number=Plur|POS=DET|PronType=Int":"Gender=Fem|Number=Plur|PronType=Int",
154
  "POS=DET":"",
155
  "Gender=Masc|Number=Plur|POS=PRON":"Gender=Masc|Number=Plur",
156
- "POS=PART":"",
157
  "Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin":"Mood=Sub|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin",
158
  "Mood=Ind|POS=VERB|Person=3|VerbForm=Fin":"Mood=Ind|Person=3|VerbForm=Fin",
159
  "Number=Sing|POS=VERB|Tense=Past|VerbForm=Part|Voice=Pass":"Number=Sing|Tense=Past|VerbForm=Part|Voice=Pass",
@@ -193,7 +193,6 @@
193
  "Mood=Imp|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin":"Mood=Imp|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin",
194
  "Mood=Sub|Number=Plur|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin":"Mood=Sub|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin",
195
  "Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Imp|VerbForm=Fin":"Mood=Ind|Number=Plur|Person=2|Tense=Imp|VerbForm=Fin",
196
- "Gender=Fem|POS=ADV":"Gender=Fem",
197
  "Mood=Ind|Number=Sing|POS=AUX|Person=2|Tense=Imp|VerbForm=Fin":"Mood=Ind|Number=Sing|Person=2|Tense=Imp|VerbForm=Fin",
198
  "Number=Plur|POS=VERB|Tense=Past|VerbForm=Part":"Number=Plur|Tense=Past|VerbForm=Part",
199
  "Gender=Fem|Number=Plur|POS=PROPN":"Gender=Fem|Number=Plur",
@@ -353,7 +352,6 @@
353
  "Gender=Fem|Number=Plur|POS=DET|PronType=Int":90,
354
  "POS=DET":90,
355
  "Gender=Masc|Number=Plur|POS=PRON":95,
356
- "POS=PART":94,
357
  "Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin":87,
358
  "Mood=Ind|POS=VERB|Person=3|VerbForm=Fin":100,
359
  "Number=Sing|POS=VERB|Tense=Past|VerbForm=Part|Voice=Pass":100,
@@ -393,10 +391,10 @@
393
  "Mood=Imp|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin":100,
394
  "Mood=Sub|Number=Plur|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin":87,
395
  "Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Imp|VerbForm=Fin":100,
396
- "Gender=Fem|POS=ADV":86,
397
  "Mood=Ind|Number=Sing|POS=AUX|Person=2|Tense=Imp|VerbForm=Fin":87,
398
  "Number=Plur|POS=VERB|Tense=Past|VerbForm=Part":100,
399
  "Gender=Fem|Number=Plur|POS=PROPN":96,
400
  "Gender=Masc|NumType=Card|POS=NUM":93
401
- }
 
402
  }
 
1
  {
2
+ "extend":false,
3
  "labels_morph":{
4
  "POS=PROPN":"",
5
  "Gender=Fem|Number=Sing|POS=DET|PronType=Dem":"Gender=Fem|Number=Sing|PronType=Dem",
 
154
  "Gender=Fem|Number=Plur|POS=DET|PronType=Int":"Gender=Fem|Number=Plur|PronType=Int",
155
  "POS=DET":"",
156
  "Gender=Masc|Number=Plur|POS=PRON":"Gender=Masc|Number=Plur",
 
157
  "Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin":"Mood=Sub|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin",
158
  "Mood=Ind|POS=VERB|Person=3|VerbForm=Fin":"Mood=Ind|Person=3|VerbForm=Fin",
159
  "Number=Sing|POS=VERB|Tense=Past|VerbForm=Part|Voice=Pass":"Number=Sing|Tense=Past|VerbForm=Part|Voice=Pass",
 
193
  "Mood=Imp|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin":"Mood=Imp|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin",
194
  "Mood=Sub|Number=Plur|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin":"Mood=Sub|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin",
195
  "Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Imp|VerbForm=Fin":"Mood=Ind|Number=Plur|Person=2|Tense=Imp|VerbForm=Fin",
 
196
  "Mood=Ind|Number=Sing|POS=AUX|Person=2|Tense=Imp|VerbForm=Fin":"Mood=Ind|Number=Sing|Person=2|Tense=Imp|VerbForm=Fin",
197
  "Number=Plur|POS=VERB|Tense=Past|VerbForm=Part":"Number=Plur|Tense=Past|VerbForm=Part",
198
  "Gender=Fem|Number=Plur|POS=PROPN":"Gender=Fem|Number=Plur",
 
352
  "Gender=Fem|Number=Plur|POS=DET|PronType=Int":90,
353
  "POS=DET":90,
354
  "Gender=Masc|Number=Plur|POS=PRON":95,
 
355
  "Mood=Sub|Number=Plur|POS=AUX|Person=3|Tense=Pres|VerbForm=Fin":87,
356
  "Mood=Ind|POS=VERB|Person=3|VerbForm=Fin":100,
357
  "Number=Sing|POS=VERB|Tense=Past|VerbForm=Part|Voice=Pass":100,
 
391
  "Mood=Imp|Number=Plur|POS=VERB|Person=1|Tense=Pres|VerbForm=Fin":100,
392
  "Mood=Sub|Number=Plur|POS=AUX|Person=2|Tense=Pres|VerbForm=Fin":87,
393
  "Mood=Ind|Number=Plur|POS=VERB|Person=2|Tense=Imp|VerbForm=Fin":100,
 
394
  "Mood=Ind|Number=Sing|POS=AUX|Person=2|Tense=Imp|VerbForm=Fin":87,
395
  "Number=Plur|POS=VERB|Tense=Past|VerbForm=Part":100,
396
  "Gender=Fem|Number=Plur|POS=PROPN":96,
397
  "Gender=Masc|NumType=Card|POS=NUM":93
398
+ },
399
+ "overwrite":true
400
  }
morphologizer/model CHANGED
Binary files a/morphologizer/model and b/morphologizer/model differ
 
ner/model CHANGED
Binary files a/ner/model and b/ner/model differ
 
parser/model CHANGED
Binary files a/parser/model and b/parser/model differ
 
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves��{"0":{"":25247},"1":{"":21688},"2":{"case":7258,"det":6062,"nsubj":1972,"punct":1645,"advmod":1210,"cc":1205,"mark":1051,"aux:tense":673,"amod":662,"nummod":595,"aux:pass":544,"obl:mod":483,"nsubj:pass":425,"cop":365,"expl:comp":204,"obj":170,"expl:subj":163,"iobj":139,"advcl":123,"nmod":92,"expl:pass":40,"vocative":35,"dep":0},"3":{"nmod":5132,"punct":3954,"amod":2083,"conj":1517,"obj":1410,"obl:mod":1184,"obl:arg":1078,"acl":782,"xcomp":739,"flat:name":657,"advmod":562,"fixed":418,"appos":408,"acl:relcl":365,"advcl":306,"ccomp":238,"obl:agent":206,"dep":138,"nummod":117,"parataxis":92,"nsubj":75,"flat:foreign":63},"4":{"ROOT":2219}}�cfg��neg_key�
 
1
+ ��moves��{"0":{"":25255},"1":{"":21680},"2":{"case":7258,"det":6062,"nsubj":1982,"punct":1645,"advmod":1210,"cc":1205,"mark":1051,"aux:tense":673,"amod":662,"nummod":595,"aux:pass":544,"obl:mod":483,"nsubj:pass":425,"cop":365,"expl:comp":204,"obj":170,"expl:subj":164,"iobj":139,"advcl":123,"nmod":92,"expl:pass":40,"vocative":35,"dep":0},"3":{"nmod":5132,"punct":3954,"amod":2083,"conj":1517,"obj":1410,"obl:mod":1184,"obl:arg":1078,"acl":782,"xcomp":739,"flat:name":657,"advmod":562,"fixed":409,"appos":408,"acl:relcl":365,"advcl":306,"ccomp":238,"obl:agent":206,"dep":138,"nummod":117,"parataxis":92,"nsubj":75,"flat:foreign":63},"4":{"ROOT":2219}}�cfg��neg_key�
senter/cfg CHANGED
@@ -1,3 +1,3 @@
1
  {
2
-
3
  }
 
1
  {
2
+ "overwrite":false
3
  }
senter/model CHANGED
Binary files a/senter/model and b/senter/model differ
 
tok2vec/model CHANGED
Binary files a/tok2vec/model and b/tok2vec/model differ
 
tokenizer CHANGED
The diff for this file is too large to render. See raw diff
 
vocab/strings.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ea0891af9c9cc0e425da394f3d2a2acde1213c29fe47cd44e47a97ea2a99471c
3
- size 2230846
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce0853933f5521cf4e1c89fa49d3d7b431fe1baa58f2b0d436be051a0c235a1b
3
+ size 2240144
vocab/vectors.cfg ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "mode":"default"
3
+ }