EC2 Default User commited on
Commit
4887ea4
1 Parent(s): 8a8eb44

Update spaCy pipeline

Browse files
.gitattributes CHANGED
@@ -19,3 +19,5 @@
19
  *strings.json filter=lfs diff=lfs merge=lfs -text
20
  vectors filter=lfs diff=lfs merge=lfs -text
21
  model filter=lfs diff=lfs merge=lfs -text
 
 
19
  *strings.json filter=lfs diff=lfs merge=lfs -text
20
  vectors filter=lfs diff=lfs merge=lfs -text
21
  model filter=lfs diff=lfs merge=lfs -text
22
+ *key2row filter=lfs diff=lfs merge=lfs -text
23
+ *tokenizer filter=lfs diff=lfs merge=lfs -text
LICENSES_SOURCES CHANGED
@@ -105,6 +105,8 @@ END OF TERMS AND CONDITIONS```
105
  * License: CC BY 4.0
106
 
107
  ```
 
 
108
  By exercising the Licensed Rights (defined below), You accept and agree to be bound by the terms and conditions of this Creative Commons Attribution 4.0 International Public License ("Public License"). To the extent this Public License may be interpreted as a contract, You are granted the Licensed Rights in consideration of Your acceptance of these terms and conditions, and the Licensor grants You such rights in consideration of benefits the Licensor receives from making the Licensed Material available under these terms and conditions.
109
 
110
  Section 1 – Definitions.
105
  * License: CC BY 4.0
106
 
107
  ```
108
+ Creative Commons Attribution 4.0 International Public License
109
+
110
  By exercising the Licensed Rights (defined below), You accept and agree to be bound by the terms and conditions of this Creative Commons Attribution 4.0 International Public License ("Public License"). To the extent this Public License may be interpreted as a contract, You are granted the Licensed Rights in consideration of Your acceptance of these terms and conditions, and the Licensor grants You such rights in consideration of benefits the Licensor receives from making the Licensed Material available under these terms and conditions.
111
 
112
  Section 1 – Definitions.
README.md CHANGED
@@ -14,47 +14,62 @@ model-index:
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
- value: 0.8121504727
18
  - name: NER Recall
19
  type: recall
20
- value: 0.8080541211
21
  - name: NER F Score
22
  type: f_score
23
- value: 0.8100971185
 
 
 
 
 
 
 
24
  - task:
25
  name: POS
26
  type: token-classification
27
  metrics:
28
- - name: POS Accuracy
29
  type: accuracy
30
- value: 0.9312032981
31
  - task:
32
- name: SENTER
33
  type: token-classification
34
  metrics:
35
- - name: SENTER Precision
36
- type: precision
37
- value: 0.8658823529
38
- - name: SENTER Recall
39
- type: recall
40
- value: 0.8932038835
41
- - name: SENTER F Score
42
- type: f_score
43
- value: 0.8793309438
44
  - task:
45
- name: UNLABELED_DEPENDENCIES
46
  type: token-classification
47
  metrics:
48
- - name: Unlabeled Dependencies Accuracy
49
  type: accuracy
50
- value: 0.8770041095
 
 
 
 
 
 
 
51
  - task:
52
  name: LABELED_DEPENDENCIES
53
  type: token-classification
54
  metrics:
55
- - name: Labeled Dependencies Accuracy
56
- type: accuracy
57
- value: 0.8770041095
 
 
 
 
 
 
 
58
  ---
59
  ### Details: https://spacy.io/models/fr#fr_core_news_sm
60
 
@@ -63,8 +78,8 @@ French pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, s
63
  | Feature | Description |
64
  | --- | --- |
65
  | **Name** | `fr_core_news_sm` |
66
- | **Version** | `3.2.0` |
67
- | **spaCy** | `>=3.2.0,<3.3.0` |
68
  | **Default Pipeline** | `tok2vec`, `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
69
  | **Components** | `tok2vec`, `morphologizer`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
70
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
@@ -76,13 +91,12 @@ French pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, s
76
 
77
  <details>
78
 
79
- <summary>View label scheme (238 labels for 4 components)</summary>
80
 
81
  | Component | Labels |
82
  | --- | --- |
83
  | **`morphologizer`** | `POS=PROPN`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Dem`, `Gender=Fem\|Number=Sing\|POS=NOUN`, `Number=Plur\|POS=PRON\|Person=1`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=SCONJ`, `POS=ADP`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Ord\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=NOUN`, `POS=PUNCT`, `Gender=Masc\|Number=Sing\|POS=PROPN`, `Number=Plur\|POS=ADJ`, `Gender=Masc\|Number=Plur\|POS=NOUN`, `Definite=Ind\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Number=Sing\|POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `POS=ADV`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=PROPN`, `Definite=Def\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Card\|POS=NUM`, `Definite=Def\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=ADJ`, `POS=CCONJ`, `Gender=Fem\|Number=Plur\|POS=NOUN`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=ADJ`, `POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `POS=PRON\|PronType=Rel`, `Number=Sing\|POS=DET\|Poss=Yes`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Number=Plur\|POS=ADP\|PronType=Art`, `Definite=Ind\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=VERB\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3`, `Number=Plur\|POS=DET`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Dem`, `POS=ADV\|PronType=Int`, `POS=VERB\|Tense=Pres\|VerbForm=Part`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Definite=Ind\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Number=Plur\|POS=DET\|Poss=Yes`, `POS=AUX\|VerbForm=Inf`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Masc\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=ADV\|Polarity=Neg`, `Definite=Ind\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3`, `POS=PRON\|Person=3\|Reflex=Yes`, `Gender=Masc\|POS=NOUN`, `POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=PRON\|Person=3`, `Number=Plur\|POS=NOUN`, `NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=AUX\|Tense=Pres\|VerbForm=Part`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|Person=3`, `Number=Sing\|POS=NOUN`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Plur\|POS=PROPN`, `Number=Sing\|POS=PROPN`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Dem`, `Gender=Masc\|Number=Sing\|POS=DET`, `Gender=Fem\|Number=Sing\|POS=DET\|Poss=Yes`, `Gender=Masc\|POS=PRON`, `POS=NOUN`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON`, `Gender=Masc\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Number=Sing\|POS=PRON`, `Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Ind\|POS=VERB\|VerbForm=Fin`, `Number=Plur\|POS=DET\|PronType=Dem`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=PRON`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Dem`, `Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=PRON`, `POS=NUM`, `Gender=Fem\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=PRON`, `Number=Plur\|POS=PRON\|Person=3`, `Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|Person=1`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=INTJ`, `Number=Plur\|POS=PRON\|Person=2`, `NumType=Card\|POS=PRON`, `Definite=Ind\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `NumType=Card\|POS=NOUN`, `POS=PRON\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3`, `Gender=Fem\|Number=Sing\|POS=DET`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Ind\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Dem`, `Gender=Masc\|Number=Plur\|POS=PROPN`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Dem`, `Number=Sing\|POS=DET`, `Gender=Masc\|NumType=Card\|Number=Plur\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Dem`, `Mood=Ind\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|POS=PRON`, `Gender=Masc\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=X`, `POS=SYM`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Int`, `POS=DET`, `Gender=Masc\|Number=Plur\|POS=PRON`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Person=3\|VerbForm=Fin`, `Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Masc\|Number=Plur\|POS=DET`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Rel`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Rel`, `POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Fem\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Imp\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|Reflex=Yes`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=1\|Reflex=Yes`, `Gender=Masc\|NumType=Card\|Number=Sing\|POS=NOUN`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Number=Sing\|POS=PRON\|Person=1\|Reflex=Yes`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|POS=PROPN`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=PROPN`, `Gender=Masc\|NumType=Card\|POS=NUM` |
84
  | **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux:pass`, `aux:tense`, `case`, `cc`, `ccomp`, `conj`, `cop`, `dep`, `det`, `expl:comp`, `expl:pass`, `expl:subj`, `fixed`, `flat:foreign`, `flat:name`, `iobj`, `mark`, `nmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl:agent`, `obl:arg`, `obl:mod`, `parataxis`, `punct`, `vocative`, `xcomp` |
85
- | **`senter`** | `I`, `S` |
86
  | **`ner`** | `LOC`, `MISC`, `ORG`, `PER` |
87
 
88
  </details>
@@ -95,18 +109,18 @@ French pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, s
95
  | `TOKEN_P` | 98.44 |
96
  | `TOKEN_R` | 98.96 |
97
  | `TOKEN_F` | 98.70 |
98
- | `POS_ACC` | 96.01 |
99
- | `MORPH_ACC` | 94.93 |
100
- | `MORPH_MICRO_P` | 97.66 |
101
- | `MORPH_MICRO_R` | 96.48 |
102
- | `MORPH_MICRO_F` | 97.07 |
103
- | `SENTS_P` | 86.59 |
104
- | `SENTS_R` | 89.32 |
105
- | `SENTS_F` | 87.93 |
106
- | `DEP_UAS` | 87.70 |
107
- | `DEP_LAS` | 83.26 |
108
- | `TAG_ACC` | 93.12 |
109
- | `LEMMA_ACC` | 90.31 |
110
- | `ENTS_P` | 81.22 |
111
- | `ENTS_R` | 80.81 |
112
- | `ENTS_F` | 81.01 |
14
  metrics:
15
  - name: NER Precision
16
  type: precision
17
+ value: 0.8116540445
18
  - name: NER Recall
19
  type: recall
20
+ value: 0.8065529803
21
  - name: NER F Score
22
  type: f_score
23
+ value: 0.8090954723
24
+ - task:
25
+ name: TAG
26
+ type: token-classification
27
+ metrics:
28
+ - name: TAG (XPOS) Accuracy
29
+ type: accuracy
30
+ value: 0.9330104092
31
  - task:
32
  name: POS
33
  type: token-classification
34
  metrics:
35
+ - name: POS (UPOS) Accuracy
36
  type: accuracy
37
+ value: 0.9619705246
38
  - task:
39
+ name: MORPH
40
  type: token-classification
41
  metrics:
42
+ - name: Morph (UFeats) Accuracy
43
+ type: accuracy
44
+ value: 0.9527981037
 
 
 
 
 
 
45
  - task:
46
+ name: LEMMA
47
  type: token-classification
48
  metrics:
49
+ - name: Lemma Accuracy
50
  type: accuracy
51
+ value: 0.9032059186
52
+ - task:
53
+ name: UNLABELED_DEPENDENCIES
54
+ type: token-classification
55
+ metrics:
56
+ - name: Unlabeled Attachment Score (UAS)
57
+ type: f_score
58
+ value: 0.8747540225
59
  - task:
60
  name: LABELED_DEPENDENCIES
61
  type: token-classification
62
  metrics:
63
+ - name: Labeled Attachment Score (LAS)
64
+ type: f_score
65
+ value: 0.8314723749
66
+ - task:
67
+ name: SENTS
68
+ type: token-classification
69
+ metrics:
70
+ - name: Sentences F-Score
71
+ type: f_score
72
+ value: 0.8829915561
73
  ---
74
  ### Details: https://spacy.io/models/fr#fr_core_news_sm
75
 
78
  | Feature | Description |
79
  | --- | --- |
80
  | **Name** | `fr_core_news_sm` |
81
+ | **Version** | `3.3.0` |
82
+ | **spaCy** | `>=3.3.0.dev0,<3.4.0` |
83
  | **Default Pipeline** | `tok2vec`, `morphologizer`, `parser`, `attribute_ruler`, `lemmatizer`, `ner` |
84
  | **Components** | `tok2vec`, `morphologizer`, `parser`, `senter`, `attribute_ruler`, `lemmatizer`, `ner` |
85
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
91
 
92
  <details>
93
 
94
+ <summary>View label scheme (236 labels for 3 components)</summary>
95
 
96
  | Component | Labels |
97
  | --- | --- |
98
  | **`morphologizer`** | `POS=PROPN`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Dem`, `Gender=Fem\|Number=Sing\|POS=NOUN`, `Number=Plur\|POS=PRON\|Person=1`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=SCONJ`, `POS=ADP`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Ord\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=NOUN`, `POS=PUNCT`, `Gender=Masc\|Number=Sing\|POS=PROPN`, `Number=Plur\|POS=ADJ`, `Gender=Masc\|Number=Plur\|POS=NOUN`, `Definite=Ind\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Number=Sing\|POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `POS=ADV`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Definite=Def\|Gender=Fem\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=PROPN`, `Definite=Def\|Number=Sing\|POS=DET\|PronType=Art`, `NumType=Card\|POS=NUM`, `Definite=Def\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Masc\|Number=Plur\|POS=ADJ`, `POS=CCONJ`, `Gender=Fem\|Number=Plur\|POS=NOUN`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=ADJ`, `POS=ADJ`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `POS=PRON\|PronType=Rel`, `Number=Sing\|POS=DET\|Poss=Yes`, `Definite=Def\|Gender=Masc\|Number=Sing\|POS=ADP\|PronType=Art`, `Definite=Def\|Number=Plur\|POS=ADP\|PronType=Art`, `Definite=Ind\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=VERB\|VerbForm=Inf`, `Gender=Fem\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3`, `Number=Plur\|POS=DET`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=ADJ`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Dem`, `POS=ADV\|PronType=Int`, `POS=VERB\|Tense=Pres\|VerbForm=Part`, `Gender=Fem\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Definite=Ind\|Gender=Masc\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Masc\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Number=Plur\|POS=DET\|Poss=Yes`, `POS=AUX\|VerbForm=Inf`, `Gender=Masc\|Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Masc\|POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=ADV\|Polarity=Neg`, `Definite=Ind\|Number=Sing\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3`, `POS=PRON\|Person=3\|Reflex=Yes`, `Gender=Masc\|POS=NOUN`, `POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=PRON\|Person=3`, `Number=Plur\|POS=NOUN`, `NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=VERB\|Tense=Past\|VerbForm=Part`, `POS=AUX\|Tense=Pres\|VerbForm=Part`, `Gender=Masc\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|Person=3`, `Number=Sing\|POS=NOUN`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Gender=Fem\|NumType=Ord\|Number=Sing\|POS=ADJ`, `Number=Plur\|POS=PROPN`, `Number=Sing\|POS=PROPN`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Dem`, `Gender=Masc\|Number=Sing\|POS=DET`, `Gender=Fem\|Number=Sing\|POS=DET\|Poss=Yes`, `Gender=Masc\|POS=PRON`, `POS=NOUN`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON`, `Gender=Masc\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Number=Sing\|POS=PRON`, `Number=Sing\|POS=PRON\|PronType=Dem`, `Mood=Ind\|POS=VERB\|VerbForm=Fin`, `Number=Plur\|POS=DET\|PronType=Dem`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=PRON`, `Gender=Masc\|Number=Sing\|POS=PRON\|Person=3\|PronType=Dem`, `Number=Sing\|POS=PRON\|Person=2\|PronType=Prs`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=3\|Tense=Imp\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|NumType=Ord\|Number=Sing\|POS=ADJ`, `POS=PRON`, `POS=NUM`, `Gender=Fem\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=PRON`, `Number=Plur\|POS=PRON\|Person=3`, `Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Number=Sing\|POS=PRON\|Person=1`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Past\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON`, `Gender=Fem\|Number=Sing\|POS=PRON\|Person=3\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `POS=INTJ`, `Number=Plur\|POS=PRON\|Person=2`, `NumType=Card\|POS=PRON`, `Definite=Ind\|Gender=Fem\|Number=Plur\|POS=DET\|PronType=Art`, `Gender=Fem\|Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `NumType=Card\|POS=NOUN`, `POS=PRON\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3`, `Gender=Fem\|Number=Sing\|POS=DET`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=DET`, `Mood=Sub\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Definite=Ind\|Gender=Masc\|Number=Plur\|POS=DET\|PronType=Art`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=PRON\|PronType=Dem`, `Gender=Masc\|Number=Plur\|POS=PROPN`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Dem`, `Number=Sing\|POS=DET`, `Gender=Masc\|NumType=Card\|Number=Plur\|POS=NOUN`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Dem`, `Mood=Ind\|POS=VERB\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|POS=PRON`, `Gender=Masc\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Fem\|Number=Sing\|POS=PRON\|PronType=Rel`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=AUX\|Tense=Past\|VerbForm=Part`, `POS=X`, `POS=SYM`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Fem\|Number=Plur\|POS=DET\|PronType=Int`, `POS=DET`, `Gender=Masc\|Number=Plur\|POS=PRON`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|POS=VERB\|Person=3\|VerbForm=Fin`, `Number=Sing\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Mood=Cnd\|Number=Plur\|POS=VERB\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Gender=Fem\|Number=Sing\|POS=DET\|PronType=Int`, `Gender=Masc\|Number=Plur\|POS=DET`, `Gender=Fem\|Number=Plur\|POS=PRON\|PronType=Rel`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Masc\|Number=Plur\|POS=PRON\|PronType=Rel`, `POS=VERB\|Tense=Past\|VerbForm=Part\|Voice=Pass`, `Gender=Fem\|NumType=Ord\|Number=Plur\|POS=ADJ`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Fut\|VerbForm=Fin`, `Mood=Imp\|POS=VERB\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|Reflex=Yes`, `Mood=Cnd\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=1\|Reflex=Yes`, `Gender=Masc\|NumType=Card\|Number=Sing\|POS=NOUN`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Number=Sing\|POS=PRON\|Person=1\|Reflex=Yes`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=AUX\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Imp\|VerbForm=Fin`, `Mood=Sub\|Number=Sing\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Gender=Masc\|POS=PROPN`, `Mood=Cnd\|Number=Plur\|POS=AUX\|Person=3\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Sub\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Number=Plur\|POS=PRON\|Person=2\|PronType=Prs`, `Mood=Ind\|Number=Sing\|POS=VERB\|Person=1\|Tense=Fut\|VerbForm=Fin`, `Gender=Fem\|Number=Plur\|POS=PRON\|Person=3\|PronType=Prs`, `Number=Sing\|POS=PRON\|Person=1\|PronType=Prs`, `Mood=Cnd\|Number=Sing\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Imp\|Number=Plur\|POS=VERB\|Person=1\|Tense=Pres\|VerbForm=Fin`, `Mood=Sub\|Number=Plur\|POS=AUX\|Person=2\|Tense=Pres\|VerbForm=Fin`, `Mood=Ind\|Number=Plur\|POS=VERB\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Mood=Ind\|Number=Sing\|POS=AUX\|Person=2\|Tense=Imp\|VerbForm=Fin`, `Number=Plur\|POS=VERB\|Tense=Past\|VerbForm=Part`, `Gender=Fem\|Number=Plur\|POS=PROPN`, `Gender=Masc\|NumType=Card\|POS=NUM` |
99
  | **`parser`** | `ROOT`, `acl`, `acl:relcl`, `advcl`, `advmod`, `amod`, `appos`, `aux:pass`, `aux:tense`, `case`, `cc`, `ccomp`, `conj`, `cop`, `dep`, `det`, `expl:comp`, `expl:pass`, `expl:subj`, `fixed`, `flat:foreign`, `flat:name`, `iobj`, `mark`, `nmod`, `nsubj`, `nsubj:pass`, `nummod`, `obj`, `obl:agent`, `obl:arg`, `obl:mod`, `parataxis`, `punct`, `vocative`, `xcomp` |
 
100
  | **`ner`** | `LOC`, `MISC`, `ORG`, `PER` |
101
 
102
  </details>
109
  | `TOKEN_P` | 98.44 |
110
  | `TOKEN_R` | 98.96 |
111
  | `TOKEN_F` | 98.70 |
112
+ | `POS_ACC` | 96.20 |
113
+ | `MORPH_ACC` | 95.28 |
114
+ | `MORPH_MICRO_P` | 97.82 |
115
+ | `MORPH_MICRO_R` | 96.83 |
116
+ | `MORPH_MICRO_F` | 97.32 |
117
+ | `SENTS_P` | 87.77 |
118
+ | `SENTS_R` | 88.83 |
119
+ | `SENTS_F` | 88.30 |
120
+ | `DEP_UAS` | 87.48 |
121
+ | `DEP_LAS` | 83.15 |
122
+ | `TAG_ACC` | 93.30 |
123
+ | `LEMMA_ACC` | 90.32 |
124
+ | `ENTS_P` | 81.17 |
125
+ | `ENTS_R` | 80.66 |
126
+ | `ENTS_F` | 80.91 |
accuracy.json CHANGED
@@ -3,66 +3,66 @@
3
  "token_p": 0.9844389844,
4
  "token_r": 0.9896058454,
5
  "token_f": 0.9870156531,
6
- "pos_acc": 0.9600618397,
7
- "morph_acc": 0.9492783505,
8
- "morph_micro_p": 0.9765677992,
9
- "morph_micro_r": 0.9648470818,
10
- "morph_micro_f": 0.9706720603,
11
  "morph_per_feat": {
12
  "Definite": {
13
- "p": 0.986100951,
14
- "r": 0.9839416058,
15
- "f": 0.985020095
16
  },
17
  "Number": {
18
- "p": 0.9906838085,
19
- "r": 0.9788291605,
20
- "f": 0.9847208075
21
  },
22
  "PronType": {
23
- "p": 0.992916935,
24
- "r": 0.9865642994,
25
- "f": 0.9897304236
26
  },
27
  "Gender": {
28
- "p": 0.9705502454,
29
- "r": 0.9601328904,
30
- "f": 0.9653134635
31
  },
32
  "Mood": {
33
- "p": 0.9575645756,
34
- "r": 0.9218472469,
35
- "f": 0.9393665158
36
  },
37
  "Person": {
38
- "p": 0.9726205997,
39
- "r": 0.9383647799,
40
- "f": 0.9551856594
41
  },
42
  "Tense": {
43
- "p": 0.936918304,
44
- "r": 0.9254341164,
45
- "f": 0.9311408016
46
  },
47
  "VerbForm": {
48
- "p": 0.9538977368,
49
- "r": 0.9420529801,
50
- "f": 0.947938359
51
  },
52
  "NumType": {
53
- "p": 0.9858156028,
54
- "r": 0.9488054608,
55
- "f": 0.9669565217
56
  },
57
  "Reflex": {
58
- "p": 0.9565217391,
59
  "r": 1.0,
60
- "f": 0.9777777778
61
  },
62
  "Voice": {
63
- "p": 0.8429752066,
64
- "r": 0.9107142857,
65
- "f": 0.8755364807
66
  },
67
  "Poss": {
68
  "p": 1.0,
@@ -70,181 +70,181 @@
70
  "f": 1.0
71
  },
72
  "Polarity": {
73
- "p": 0.9880952381,
74
- "r": 0.9764705882,
75
- "f": 0.9822485207
76
  }
77
  },
78
- "sents_p": 0.8658823529,
79
- "sents_r": 0.8932038835,
80
- "sents_f": 0.8793309438,
81
- "dep_uas": 0.8770041095,
82
- "dep_las": 0.832561907,
83
  "dep_las_per_type": {
84
  "det": {
85
- "p": 0.9724919094,
86
- "r": 0.9701372074,
87
- "f": 0.9713131313
88
  },
89
  "nsubj": {
90
- "p": 0.8618925831,
91
- "r": 0.8120481928,
92
- "f": 0.8362282878
93
  },
94
  "aux:tense": {
95
- "p": 0.9206349206,
96
- "r": 0.928,
97
- "f": 0.9243027888
98
  },
99
  "root": {
100
- "p": 0.853427896,
101
- "r": 0.8762135922,
102
- "f": 0.8646706587
103
  },
104
  "obj": {
105
- "p": 0.8171091445,
106
  "r": 0.821958457,
107
- "f": 0.8195266272
108
  },
109
  "cc": {
110
- "p": 0.869955157,
111
- "r": 0.8940092166,
112
- "f": 0.8818181818
113
  },
114
  "case": {
115
- "p": 0.9600811908,
116
- "r": 0.9666212534,
117
- "f": 0.9633401222
118
  },
119
  "obl:mod": {
120
- "p": 0.6214511041,
121
- "r": 0.5880597015,
122
- "f": 0.6042944785
123
  },
124
  "nmod": {
125
- "p": 0.7838095238,
126
- "r": 0.8221778222,
127
- "f": 0.8025353486
128
  },
129
  "conj": {
130
- "p": 0.5307692308,
131
- "r": 0.5433070866,
132
- "f": 0.5369649805
133
  },
134
  "nummod": {
135
- "p": 0.9210526316,
136
- "r": 0.8284023669,
137
- "f": 0.8722741433
138
  },
139
  "amod": {
140
- "p": 0.8683729433,
141
- "r": 0.8652094718,
142
- "f": 0.8667883212
143
  },
144
  "acl": {
145
- "p": 0.6411764706,
146
- "r": 0.6300578035,
147
- "f": 0.6355685131
148
  },
149
  "mark": {
150
- "p": 0.9052132701,
151
- "r": 0.8414096916,
152
- "f": 0.8721461187
153
  },
154
  "xcomp": {
155
- "p": 0.8,
156
- "r": 0.7947019868,
157
- "f": 0.7973421927
158
  },
159
  "flat:name": {
160
- "p": 0.8482142857,
161
- "r": 0.9047619048,
162
- "f": 0.8755760369
163
  },
164
  "cop": {
165
- "p": 0.8571428571,
166
- "r": 0.8,
167
- "f": 0.8275862069
168
  },
169
  "advmod": {
170
- "p": 0.8338658147,
171
- "r": 0.8181818182,
172
- "f": 0.8259493671
173
  },
174
  "obl:arg": {
175
- "p": 0.6553398058,
176
- "r": 0.6136363636,
177
- "f": 0.6338028169
178
  },
179
  "appos": {
180
- "p": 0.417721519,
181
- "r": 0.3975903614,
182
- "f": 0.4074074074
183
  },
184
  "nsubj:pass": {
185
- "p": 0.7717391304,
186
- "r": 0.8352941176,
187
- "f": 0.802259887
188
  },
189
  "aux:pass": {
190
- "p": 0.9137931034,
191
  "r": 0.9464285714,
192
- "f": 0.9298245614
193
  },
194
  "acl:relcl": {
195
- "p": 0.5714285714,
196
- "r": 0.6046511628,
197
- "f": 0.5875706215
198
  },
199
  "advcl": {
200
- "p": 0.4929577465,
201
- "r": 0.4487179487,
202
- "f": 0.4697986577
203
  },
204
  "fixed": {
205
- "p": 0.691588785,
206
  "r": 0.74,
207
- "f": 0.7149758454
208
  },
209
  "dep": {
210
- "p": 0.2884615385,
211
- "r": 0.5172413793,
212
- "f": 0.3703703704
213
  },
214
  "expl:subj": {
215
- "p": 0.7058823529,
216
- "r": 0.75,
217
- "f": 0.7272727273
218
  },
219
  "expl:comp": {
220
- "p": 0.7428571429,
221
- "r": 0.8666666667,
222
- "f": 0.8
223
  },
224
  "expl:pass": {
225
- "p": 0.4,
226
  "r": 0.2857142857,
227
- "f": 0.3333333333
228
  },
229
  "obl:agent": {
230
- "p": 0.8205128205,
231
  "r": 0.7619047619,
232
- "f": 0.7901234568
233
  },
234
  "ccomp": {
235
- "p": 0.6603773585,
236
- "r": 0.6862745098,
237
- "f": 0.6730769231
238
  },
239
  "parataxis": {
240
- "p": 0.36,
241
- "r": 0.3214285714,
242
- "f": 0.3396226415
243
  },
244
  "iobj": {
245
- "p": 0.7,
246
- "r": 0.56,
247
- "f": 0.6222222222
248
  },
249
  "nsubj:caus": {
250
  "p": 0.0,
@@ -267,9 +267,9 @@
267
  "f": 0.0
268
  },
269
  "vocative": {
270
- "p": 1.0,
271
  "r": 0.625,
272
- "f": 0.7692307692
273
  },
274
  "dislocated": {
275
  "p": 0.0,
@@ -277,9 +277,9 @@
277
  "f": 0.0
278
  },
279
  "flat:foreign": {
280
- "p": 0.0,
281
- "r": 0.0,
282
- "f": 0.0
283
  },
284
  "orphan": {
285
  "p": 0.0,
@@ -297,32 +297,32 @@
297
  "f": 0.0
298
  }
299
  },
300
- "tag_acc": 0.9312032981,
301
- "lemma_acc": 0.9031031648,
302
- "ents_p": 0.8121504727,
303
- "ents_r": 0.8080541211,
304
- "ents_f": 0.8100971185,
305
  "ents_per_type": {
306
  "PER": {
307
- "p": 0.8685030449,
308
- "r": 0.8787705094,
309
- "f": 0.8736066099
310
  },
311
  "LOC": {
312
- "p": 0.8245838668,
313
- "r": 0.835104158,
314
- "f": 0.8298106698
315
  },
316
  "ORG": {
317
- "p": 0.7541699762,
318
- "r": 0.7248091603,
319
- "f": 0.7391981316
320
  },
321
  "MISC": {
322
- "p": 0.6852231509,
323
- "r": 0.6334742674,
324
- "f": 0.6583333333
325
  }
326
  },
327
- "speed": 4222.2093213177
328
  }
3
  "token_p": 0.9844389844,
4
  "token_r": 0.9896058454,
5
  "token_f": 0.9870156531,
6
+ "pos_acc": 0.9619705246,
7
+ "morph_acc": 0.9527981037,
8
+ "morph_micro_p": 0.9781525017,
9
+ "morph_micro_r": 0.9683197271,
10
+ "morph_micro_f": 0.9732112788,
11
  "morph_per_feat": {
12
  "Definite": {
13
+ "p": 0.9868517166,
14
+ "r": 0.9861313869,
15
+ "f": 0.9864914202
16
  },
17
  "Number": {
18
+ "p": 0.9914609244,
19
+ "r": 0.9832474227,
20
+ "f": 0.9873370922
21
  },
22
  "PronType": {
23
+ "p": 0.9922829582,
24
+ "r": 0.9872040947,
25
+ "f": 0.9897370109
26
  },
27
  "Gender": {
28
+ "p": 0.9663930221,
29
+ "r": 0.9626884743,
30
+ "f": 0.9645371911
31
  },
32
  "Mood": {
33
+ "p": 0.9611829945,
34
+ "r": 0.9236234458,
35
+ "f": 0.9420289855
36
  },
37
  "Person": {
38
+ "p": 0.9804177546,
39
+ "r": 0.9446540881,
40
+ "f": 0.9622037156
41
  },
42
  "Tense": {
43
+ "p": 0.946930281,
44
+ "r": 0.9295199183,
45
+ "f": 0.9381443299
46
  },
47
  "VerbForm": {
48
+ "p": 0.9653716216,
49
+ "r": 0.946192053,
50
+ "f": 0.9556856187
51
  },
52
  "NumType": {
53
+ "p": 0.9721254355,
54
+ "r": 0.95221843,
55
+ "f": 0.9620689655
56
  },
57
  "Reflex": {
58
+ "p": 0.9777777778,
59
  "r": 1.0,
60
+ "f": 0.9887640449
61
  },
62
  "Voice": {
63
+ "p": 0.9043478261,
64
+ "r": 0.9285714286,
65
+ "f": 0.9162995595
66
  },
67
  "Poss": {
68
  "p": 1.0,
70
  "f": 1.0
71
  },
72
  "Polarity": {
73
+ "p": 1.0,
74
+ "r": 0.9882352941,
75
+ "f": 0.9940828402
76
  }
77
  },
78
+ "sents_p": 0.8776978417,
79
+ "sents_r": 0.8883495146,
80
+ "sents_f": 0.8829915561,
81
+ "dep_uas": 0.8747540225,
82
+ "dep_las": 0.8314723749,
83
  "dep_las_per_type": {
84
  "det": {
85
+ "p": 0.9715447154,
86
+ "r": 0.9644874899,
87
+ "f": 0.9680032402
88
  },
89
  "nsubj": {
90
+ "p": 0.8746803069,
91
+ "r": 0.8240963855,
92
+ "f": 0.8486352357
93
  },
94
  "aux:tense": {
95
+ "p": 0.9069767442,
96
+ "r": 0.936,
97
+ "f": 0.9212598425
98
  },
99
  "root": {
100
+ "p": 0.847826087,
101
+ "r": 0.8519417476,
102
+ "f": 0.8498789346
103
  },
104
  "obj": {
105
+ "p": 0.8147058824,
106
  "r": 0.821958457,
107
+ "f": 0.8183161004
108
  },
109
  "cc": {
110
+ "p": 0.876146789,
111
+ "r": 0.8801843318,
112
+ "f": 0.8781609195
113
  },
114
  "case": {
115
+ "p": 0.9633898305,
116
+ "r": 0.9679836512,
117
+ "f": 0.9656812776
118
  },
119
  "obl:mod": {
120
+ "p": 0.6180124224,
121
+ "r": 0.5940298507,
122
+ "f": 0.6057838661
123
  },
124
  "nmod": {
125
+ "p": 0.780148423,
126
+ "r": 0.8401598402,
127
+ "f": 0.809042809
128
  },
129
  "conj": {
130
+ "p": 0.4789272031,
131
+ "r": 0.4921259843,
132
+ "f": 0.4854368932
133
  },
134
  "nummod": {
135
+ "p": 0.9068322981,
136
+ "r": 0.8639053254,
137
+ "f": 0.8848484848
138
  },
139
  "amod": {
140
+ "p": 0.8653136531,
141
+ "r": 0.85428051,
142
+ "f": 0.8597616865
143
  },
144
  "acl": {
145
+ "p": 0.6848484848,
146
+ "r": 0.6531791908,
147
+ "f": 0.6686390533
148
  },
149
  "mark": {
150
+ "p": 0.852173913,
151
+ "r": 0.8634361233,
152
+ "f": 0.8577680525
153
  },
154
  "xcomp": {
155
+ "p": 0.7945205479,
156
+ "r": 0.7682119205,
157
+ "f": 0.7811447811
158
  },
159
  "flat:name": {
160
+ "p": 0.932038835,
161
+ "r": 0.9142857143,
162
+ "f": 0.9230769231
163
  },
164
  "cop": {
165
+ "p": 0.880952381,
166
+ "r": 0.8222222222,
167
+ "f": 0.8505747126
168
  },
169
  "advmod": {
170
+ "p": 0.8096774194,
171
+ "r": 0.7868338558,
172
+ "f": 0.7980922099
173
  },
174
  "obl:arg": {
175
+ "p": 0.6473429952,
176
+ "r": 0.6090909091,
177
+ "f": 0.6276346604
178
  },
179
  "appos": {
180
+ "p": 0.5,
181
+ "r": 0.4578313253,
182
+ "f": 0.4779874214
183
  },
184
  "nsubj:pass": {
185
+ "p": 0.8414634146,
186
+ "r": 0.8117647059,
187
+ "f": 0.8263473054
188
  },
189
  "aux:pass": {
190
+ "p": 0.8833333333,
191
  "r": 0.9464285714,
192
+ "f": 0.9137931034
193
  },
194
  "acl:relcl": {
195
+ "p": 0.6,
196
+ "r": 0.5581395349,
197
+ "f": 0.578313253
198
  },
199
  "advcl": {
200
+ "p": 0.4444444444,
201
+ "r": 0.5128205128,
202
+ "f": 0.4761904762
203
  },
204
  "fixed": {
205
+ "p": 0.7956989247,
206
  "r": 0.74,
207
+ "f": 0.7668393782
208
  },
209
  "dep": {
210
+ "p": 0.2244897959,
211
+ "r": 0.3793103448,
212
+ "f": 0.2820512821
213
  },
214
  "expl:subj": {
215
+ "p": 0.7027027027,
216
+ "r": 0.8125,
217
+ "f": 0.7536231884
218
  },
219
  "expl:comp": {
220
+ "p": 0.6923076923,
221
+ "r": 0.9,
222
+ "f": 0.7826086957
223
  },
224
  "expl:pass": {
225
+ "p": 0.3333333333,
226
  "r": 0.2857142857,
227
+ "f": 0.3076923077
228
  },
229
  "obl:agent": {
230
+ "p": 0.8,
231
  "r": 0.7619047619,
232
+ "f": 0.7804878049
233
  },
234
  "ccomp": {
235
+ "p": 0.6511627907,
236
+ "r": 0.5490196078,
237
+ "f": 0.5957446809
238
  },
239
  "parataxis": {
240
+ "p": 0.5,
241
+ "r": 0.3928571429,
242
+ "f": 0.44
243
  },
244
  "iobj": {
245
+ "p": 0.7222222222,
246
+ "r": 0.52,
247
+ "f": 0.6046511628
248
  },
249
  "nsubj:caus": {
250
  "p": 0.0,
267
  "f": 0.0
268
  },
269
  "vocative": {
270
+ "p": 0.7142857143,
271
  "r": 0.625,
272
+ "f": 0.6666666667
273
  },
274
  "dislocated": {
275
  "p": 0.0,
277
  "f": 0.0
278
  },
279
  "flat:foreign": {
280
+ "p": 1.0,
281
+ "r": 0.2857142857,
282
+ "f": 0.4444444444
283
  },
284
  "orphan": {
285
  "p": 0.0,
297
  "f": 0.0
298
  }
299
  },
300
+ "tag_acc": 0.9330104092,
301
+ "lemma_acc": 0.9032059186,
302
+ "ents_p": 0.8116540445,
303
+ "ents_r": 0.8065529803,
304
+ "ents_f": 0.8090954723,
305
  "ents_per_type": {
306
  "PER": {
307
+ "p": 0.8653387572,
308
+ "r": 0.8840008598,
309
+ "f": 0.874570264
310
  },
311
  "LOC": {
312
+ "p": 0.8252660345,
313
+ "r": 0.8337237514,
314
+ "f": 0.8294733337
315
  },
316
  "ORG": {
317
+ "p": 0.7441767868,
318
+ "r": 0.7133587786,
319
+ "f": 0.728441976
320
  },
321
  "MISC": {
322
+ "p": 0.6901544402,
323
+ "r": 0.6254556058,
324
+ "f": 0.6562141491
325
  }
326
  },
327
+ "speed": 4450.1044334226
328
  }
attribute_ruler/patterns CHANGED
Binary files a/attribute_ruler/patterns and b/attribute_ruler/patterns differ
config.cfg CHANGED
@@ -39,8 +39,9 @@ overwrite = true
39
  scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
40
 
41
  [components.morphologizer.model]
42
- @architectures = "spacy.Tagger.v1"
43
  nO = null
 
44
 
45
  [components.morphologizer.model.tok2vec]
46
  @architectures = "spacy.Tok2VecListener.v1"
@@ -70,7 +71,7 @@ nO = null
70
  @architectures = "spacy.MultiHashEmbed.v2"
71
  width = 96
72
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
73
- rows = [5000,2500,2500,2500,100]
74
  include_static_vectors = false
75
 
76
  [components.ner.model.tok2vec.encode]
@@ -108,8 +109,9 @@ overwrite = false
108
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
109
 
110
  [components.senter.model]
111
- @architectures = "spacy.Tagger.v1"
112
  nO = null
 
113
 
114
  [components.senter.model.tok2vec]
115
  @architectures = "spacy.Tok2Vec.v2"
@@ -138,7 +140,7 @@ factory = "tok2vec"
138
  @architectures = "spacy.MultiHashEmbed.v2"
139
  width = ${components.tok2vec.model.encode:width}
140
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
141
- rows = [5000,2500,2500,2500,100]
142
  include_static_vectors = false
143
 
144
  [components.tok2vec.model.encode]
@@ -175,7 +177,7 @@ dropout = 0.1
175
  accumulate_gradient = 1
176
  patience = 5000
177
  max_epochs = 0
178
- max_steps = 0
179
  eval_frequency = 1000
180
  frozen_components = []
181
  before_to_disk = null
39
  scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
40
 
41
  [components.morphologizer.model]
42
+ @architectures = "spacy.Tagger.v2"
43
  nO = null
44
+ normalize = false
45
 
46
  [components.morphologizer.model.tok2vec]
47
  @architectures = "spacy.Tok2VecListener.v1"
71
  @architectures = "spacy.MultiHashEmbed.v2"
72
  width = 96
73
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
74
+ rows = [5000,1000,2500,2500,50]
75
  include_static_vectors = false
76
 
77
  [components.ner.model.tok2vec.encode]
109
  scorer = {"@scorers":"spacy.senter_scorer.v1"}
110
 
111
  [components.senter.model]
112
+ @architectures = "spacy.Tagger.v2"
113
  nO = null
114
+ normalize = false
115
 
116
  [components.senter.model.tok2vec]
117
  @architectures = "spacy.Tok2Vec.v2"
140
  @architectures = "spacy.MultiHashEmbed.v2"
141
  width = ${components.tok2vec.model.encode:width}
142
  attrs = ["NORM","PREFIX","SUFFIX","SHAPE","SPACY"]
143
+ rows = [5000,1000,2500,2500,50]
144
  include_static_vectors = false
145
 
146
  [components.tok2vec.model.encode]
177
  accumulate_gradient = 1
178
  patience = 5000
179
  max_epochs = 0
180
+ max_steps = 100000
181
  eval_frequency = 1000
182
  frozen_components = []
183
  before_to_disk = null
fr_core_news_sm-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ec38039523b1b64535f1c7d11ce45e6629404a50fa083f193d5555d9f6ac1a30
3
- size 17362258
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:78de7071c4989733c9c08469d7b9b01b79ec7feadc9129a07f4fde0db128e036
3
+ size 16269344
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"fr",
3
  "name":"core_news_sm",
4
- "version":"3.2.0",
5
  "description":"French pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"LGPL-LR",
10
- "spacy_version":">=3.2.0,<3.3.0",
11
- "spacy_git_version":"bb26550e2",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
@@ -255,10 +255,6 @@
255
  "vocative",
256
  "xcomp"
257
  ],
258
- "senter":[
259
- "I",
260
- "S"
261
- ],
262
  "attribute_ruler":[
263
 
264
  ],
@@ -297,66 +293,66 @@
297
  "token_p":0.9844389844,
298
  "token_r":0.9896058454,
299
  "token_f":0.9870156531,
300
- "pos_acc":0.9600618397,
301
- "morph_acc":0.9492783505,
302
- "morph_micro_p":0.9765677992,
303
- "morph_micro_r":0.9648470818,
304
- "morph_micro_f":0.9706720603,
305
  "morph_per_feat":{
306
  "Definite":{
307
- "p":0.986100951,
308
- "r":0.9839416058,
309
- "f":0.985020095
310
  },
311
  "Number":{
312
- "p":0.9906838085,
313
- "r":0.9788291605,
314
- "f":0.9847208075
315
  },
316
  "PronType":{
317
- "p":0.992916935,
318
- "r":0.9865642994,
319
- "f":0.9897304236
320
  },
321
  "Gender":{
322
- "p":0.9705502454,
323
- "r":0.9601328904,
324
- "f":0.9653134635
325
  },
326
  "Mood":{
327
- "p":0.9575645756,
328
- "r":0.9218472469,
329
- "f":0.9393665158
330
  },
331
  "Person":{
332
- "p":0.9726205997,
333
- "r":0.9383647799,
334
- "f":0.9551856594
335
  },
336
  "Tense":{
337
- "p":0.936918304,
338
- "r":0.9254341164,
339
- "f":0.9311408016
340
  },
341
  "VerbForm":{
342
- "p":0.9538977368,
343
- "r":0.9420529801,
344
- "f":0.947938359
345
  },
346
  "NumType":{
347
- "p":0.9858156028,
348
- "r":0.9488054608,
349
- "f":0.9669565217
350
  },
351
  "Reflex":{
352
- "p":0.9565217391,
353
  "r":1.0,
354
- "f":0.9777777778
355
  },
356
  "Voice":{
357
- "p":0.8429752066,
358
- "r":0.9107142857,
359
- "f":0.8755364807
360
  },
361
  "Poss":{
362
  "p":1.0,
@@ -364,181 +360,181 @@
364
  "f":1.0
365
  },
366
  "Polarity":{
367
- "p":0.9880952381,
368
- "r":0.9764705882,
369
- "f":0.9822485207
370
  }
371
  },
372
- "sents_p":0.8658823529,
373
- "sents_r":0.8932038835,
374
- "sents_f":0.8793309438,
375
- "dep_uas":0.8770041095,
376
- "dep_las":0.832561907,
377
  "dep_las_per_type":{
378
  "det":{
379
- "p":0.9724919094,
380
- "r":0.9701372074,
381
- "f":0.9713131313
382
  },
383
  "nsubj":{
384
- "p":0.8618925831,
385
- "r":0.8120481928,
386
- "f":0.8362282878
387
  },
388
  "aux:tense":{
389
- "p":0.9206349206,
390
- "r":0.928,
391
- "f":0.9243027888
392
  },
393
  "root":{
394
- "p":0.853427896,
395
- "r":0.8762135922,
396
- "f":0.8646706587
397
  },
398
  "obj":{
399
- "p":0.8171091445,
400
  "r":0.821958457,
401
- "f":0.8195266272
402
  },
403
  "cc":{
404
- "p":0.869955157,
405
- "r":0.8940092166,
406
- "f":0.8818181818
407
  },
408
  "case":{
409
- "p":0.9600811908,
410
- "r":0.9666212534,
411
- "f":0.9633401222
412
  },
413
  "obl:mod":{
414
- "p":0.6214511041,
415
- "r":0.5880597015,
416
- "f":0.6042944785
417
  },
418
  "nmod":{
419
- "p":0.7838095238,
420
- "r":0.8221778222,
421
- "f":0.8025353486
422
  },
423
  "conj":{
424
- "p":0.5307692308,
425
- "r":0.5433070866,
426
- "f":0.5369649805
427
  },
428
  "nummod":{
429
- "p":0.9210526316,
430
- "r":0.8284023669,
431
- "f":0.8722741433
432
  },
433
  "amod":{
434
- "p":0.8683729433,
435
- "r":0.8652094718,
436
- "f":0.8667883212
437
  },
438
  "acl":{
439
- "p":0.6411764706,
440
- "r":0.6300578035,
441
- "f":0.6355685131
442
  },
443
  "mark":{
444
- "p":0.9052132701,
445
- "r":0.8414096916,
446
- "f":0.8721461187
447
  },
448
  "xcomp":{
449
- "p":0.8,
450
- "r":0.7947019868,
451
- "f":0.7973421927
452
  },
453
  "flat:name":{
454
- "p":0.8482142857,
455
- "r":0.9047619048,
456
- "f":0.8755760369
457
  },
458
  "cop":{
459
- "p":0.8571428571,
460
- "r":0.8,
461
- "f":0.8275862069
462
  },
463
  "advmod":{
464
- "p":0.8338658147,
465
- "r":0.8181818182,
466
- "f":0.8259493671
467
  },
468
  "obl:arg":{
469
- "p":0.6553398058,
470
- "r":0.6136363636,
471
- "f":0.6338028169
472
  },
473
  "appos":{
474
- "p":0.417721519,
475
- "r":0.3975903614,
476
- "f":0.4074074074
477
  },
478
  "nsubj:pass":{
479
- "p":0.7717391304,
480
- "r":0.8352941176,
481
- "f":0.802259887
482
  },
483
  "aux:pass":{
484
- "p":0.9137931034,
485
  "r":0.9464285714,
486
- "f":0.9298245614
487
  },
488
  "acl:relcl":{
489
- "p":0.5714285714,
490
- "r":0.6046511628,
491
- "f":0.5875706215
492
  },
493
  "advcl":{
494
- "p":0.4929577465,
495
- "r":0.4487179487,
496
- "f":0.4697986577
497
  },
498
  "fixed":{
499
- "p":0.691588785,
500
  "r":0.74,
501
- "f":0.7149758454
502
  },
503
  "dep":{
504
- "p":0.2884615385,
505
- "r":0.5172413793,
506
- "f":0.3703703704
507
  },
508
  "expl:subj":{
509
- "p":0.7058823529,
510
- "r":0.75,
511
- "f":0.7272727273
512
  },
513
  "expl:comp":{
514
- "p":0.7428571429,
515
- "r":0.8666666667,
516
- "f":0.8
517
  },
518
  "expl:pass":{
519
- "p":0.4,
520
  "r":0.2857142857,
521
- "f":0.3333333333
522
  },
523
  "obl:agent":{
524
- "p":0.8205128205,
525
  "r":0.7619047619,
526
- "f":0.7901234568
527
  },
528
  "ccomp":{
529
- "p":0.6603773585,
530
- "r":0.6862745098,
531
- "f":0.6730769231
532
  },
533
  "parataxis":{
534
- "p":0.36,
535
- "r":0.3214285714,
536
- "f":0.3396226415
537
  },
538
  "iobj":{
539
- "p":0.7,
540
- "r":0.56,
541
- "f":0.6222222222
542
  },
543
  "nsubj:caus":{
544
  "p":0.0,
@@ -561,9 +557,9 @@
561
  "f":0.0
562
  },
563
  "vocative":{
564
- "p":1.0,
565
  "r":0.625,
566
- "f":0.7692307692
567
  },
568
  "dislocated":{
569
  "p":0.0,
@@ -571,9 +567,9 @@
571
  "f":0.0
572
  },
573
  "flat:foreign":{
574
- "p":0.0,
575
- "r":0.0,
576
- "f":0.0
577
  },
578
  "orphan":{
579
  "p":0.0,
@@ -591,34 +587,34 @@
591
  "f":0.0
592
  }
593
  },
594
- "tag_acc":0.9312032981,
595
- "lemma_acc":0.9031031648,
596
- "ents_p":0.8121504727,
597
- "ents_r":0.8080541211,
598
- "ents_f":0.8100971185,
599
  "ents_per_type":{
600
  "PER":{
601
- "p":0.8685030449,
602
- "r":0.8787705094,
603
- "f":0.8736066099
604
  },
605
  "LOC":{
606
- "p":0.8245838668,
607
- "r":0.835104158,
608
- "f":0.8298106698
609
  },
610
  "ORG":{
611
- "p":0.7541699762,
612
- "r":0.7248091603,
613
- "f":0.7391981316
614
  },
615
  "MISC":{
616
- "p":0.6852231509,
617
- "r":0.6334742674,
618
- "f":0.6583333333
619
  }
620
  },
621
- "speed":4222.2093213177
622
  },
623
  "sources":[
624
  {
1
  {
2
  "lang":"fr",
3
  "name":"core_news_sm",
4
+ "version":"3.3.0",
5
  "description":"French pipeline optimized for CPU. Components: tok2vec, morphologizer, parser, senter, ner, attribute_ruler, lemmatizer.",
6
  "author":"Explosion",
7
  "email":"contact@explosion.ai",
8
  "url":"https://explosion.ai",
9
  "license":"LGPL-LR",
10
+ "spacy_version":">=3.3.0.dev0,<3.4.0",
11
+ "spacy_git_version":"849bef2de",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
255
  "vocative",
256
  "xcomp"
257
  ],
 
 
 
 
258
  "attribute_ruler":[
259
 
260
  ],
293
  "token_p":0.9844389844,
294
  "token_r":0.9896058454,
295
  "token_f":0.9870156531,
296
+ "pos_acc":0.9619705246,
297
+ "morph_acc":0.9527981037,
298
+ "morph_micro_p":0.9781525017,
299
+ "morph_micro_r":0.9683197271,
300
+ "morph_micro_f":0.9732112788,
301
  "morph_per_feat":{
302
  "Definite":{
303
+ "p":0.9868517166,
304
+ "r":0.9861313869,
305
+ "f":0.9864914202
306
  },
307
  "Number":{
308
+ "p":0.9914609244,
309
+ "r":0.9832474227,
310
+ "f":0.9873370922
311
  },
312
  "PronType":{
313
+ "p":0.9922829582,
314
+ "r":0.9872040947,
315
+ "f":0.9897370109
316
  },
317
  "Gender":{
318
+ "p":0.9663930221,
319
+ "r":0.9626884743,
320
+ "f":0.9645371911
321
  },
322
  "Mood":{
323
+ "p":0.9611829945,
324
+ "r":0.9236234458,
325
+ "f":0.9420289855
326
  },
327
  "Person":{
328
+ "p":0.9804177546,
329
+ "r":0.9446540881,
330
+ "f":0.9622037156
331
  },
332
  "Tense":{
333
+ "p":0.946930281,
334
+ "r":0.9295199183,
335
+ "f":0.9381443299
336
  },
337
  "VerbForm":{
338
+ "p":0.9653716216,
339
+ "r":0.946192053,
340
+ "f":0.9556856187
341
  },
342
  "NumType":{
343
+ "p":0.9721254355,
344
+ "r":0.95221843,
345
+ "f":0.9620689655
346
  },
347
  "Reflex":{
348
+ "p":0.9777777778,
349
  "r":1.0,
350
+ "f":0.9887640449
351
  },
352
  "Voice":{
353
+ "p":0.9043478261,
354
+ "r":0.9285714286,
355
+ "f":0.9162995595
356
  },
357
  "Poss":{
358
  "p":1.0,
360
  "f":1.0
361
  },
362
  "Polarity":{
363
+ "p":1.0,
364
+ "r":0.9882352941,
365
+ "f":0.9940828402
366
  }
367
  },
368
+ "sents_p":0.8776978417,
369
+ "sents_r":0.8883495146,
370
+ "sents_f":0.8829915561,
371
+ "dep_uas":0.8747540225,
372
+ "dep_las":0.8314723749,
373
  "dep_las_per_type":{
374
  "det":{
375
+ "p":0.9715447154,
376
+ "r":0.9644874899,
377
+ "f":0.9680032402
378
  },
379
  "nsubj":{
380
+ "p":0.8746803069,
381
+ "r":0.8240963855,
382
+ "f":0.8486352357
383
  },
384
  "aux:tense":{
385
+ "p":0.9069767442,
386
+ "r":0.936,
387
+ "f":0.9212598425
388
  },
389
  "root":{
390
+ "p":0.847826087,
391
+ "r":0.8519417476,
392
+ "f":0.8498789346
393
  },
394
  "obj":{
395
+ "p":0.8147058824,
396
  "r":0.821958457,
397
+ "f":0.8183161004
398
  },
399
  "cc":{
400
+ "p":0.876146789,
401
+ "r":0.8801843318,
402
+ "f":0.8781609195
403
  },
404
  "case":{
405
+ "p":0.9633898305,
406
+ "r":0.9679836512,
407
+ "f":0.9656812776
408
  },
409
  "obl:mod":{
410
+ "p":0.6180124224,
411
+ "r":0.5940298507,
412
+ "f":0.6057838661
413
  },
414
  "nmod":{
415
+ "p":0.780148423,
416
+ "r":0.8401598402,
417
+ "f":0.809042809
418
  },
419
  "conj":{
420
+ "p":0.4789272031,
421
+ "r":0.4921259843,
422
+ "f":0.4854368932
423
  },
424
  "nummod":{
425
+ "p":0.9068322981,
426
+ "r":0.8639053254,
427
+ "f":0.8848484848
428
  },
429
  "amod":{
430
+ "p":0.8653136531,
431
+ "r":0.85428051,
432
+ "f":0.8597616865
433
  },
434
  "acl":{
435
+ "p":0.6848484848,
436
+ "r":0.6531791908,
437
+ "f":0.6686390533
438
  },
439
  "mark":{
440
+ "p":0.852173913,
441
+ "r":0.8634361233,
442
+ "f":0.8577680525
443
  },
444
  "xcomp":{
445
+ "p":0.7945205479,
446
+ "r":0.7682119205,
447
+ "f":0.7811447811
448
  },
449
  "flat:name":{
450
+ "p":0.932038835,
451
+ "r":0.9142857143,
452
+ "f":0.9230769231
453
  },
454
  "cop":{
455
+ "p":0.880952381,
456
+ "r":0.8222222222,
457
+ "f":0.8505747126
458
  },
459
  "advmod":{
460
+ "p":0.8096774194,
461
+ "r":0.7868338558,
462
+ "f":0.7980922099
463
  },
464
  "obl:arg":{
465
+ "p":0.6473429952,
466
+ "r":0.6090909091,
467
+ "f":0.6276346604
468
  },
469
  "appos":{
470
+ "p":0.5,
471
+ "r":0.4578313253,
472
+ "f":0.4779874214
473
  },
474
  "nsubj:pass":{
475
+ "p":0.8414634146,
476
+ "r":0.8117647059,
477
+ "f":0.8263473054
478
  },
479
  "aux:pass":{
480
+ "p":0.8833333333,
481
  "r":0.9464285714,
482
+ "f":0.9137931034
483
  },
484
  "acl:relcl":{
485
+ "p":0.6,
486
+ "r":0.5581395349,
487
+ "f":0.578313253
488
  },
489
  "advcl":{
490
+ "p":0.4444444444,
491
+ "r":0.5128205128,
492
+ "f":0.4761904762
493
  },
494
  "fixed":{
495
+ "p":0.7956989247,
496
  "r":0.74,
497
+ "f":0.7668393782
498
  },
499
  "dep":{
500
+ "p":0.2244897959,
501
+ "r":0.3793103448,
502
+ "f":0.2820512821
503
  },
504
  "expl:subj":{
505
+ "p":0.7027027027,
506
+ "r":0.8125,
507
+ "f":0.7536231884
508
  },
509
  "expl:comp":{
510
+ "p":0.6923076923,
511
+ "r":0.9,
512
+ "f":0.7826086957
513
  },
514
  "expl:pass":{
515
+ "p":0.3333333333,
516
  "r":0.2857142857,
517
+ "f":0.3076923077
518
  },
519
  "obl:agent":{
520
+ "p":0.8,
521
  "r":0.7619047619,
522
+ "f":0.7804878049
523
  },
524
  "ccomp":{
525
+ "p":0.6511627907,
526
+ "r":0.5490196078,
527
+ "f":0.5957446809
528
  },
529
  "parataxis":{
530
+ "p":0.5,
531
+ "r":0.3928571429,
532
+ "f":0.44
533
  },
534
  "iobj":{
535
+ "p":0.7222222222,
536
+ "r":0.52,
537
+ "f":0.6046511628
538
  },
539
  "nsubj:caus":{
540
  "p":0.0,
557
  "f":0.0
558
  },
559
  "vocative":{
560
+ "p":0.7142857143,
561
  "r":0.625,
562
+ "f":0.6666666667
563
  },
564
  "dislocated":{
565
  "p":0.0,
567
  "f":0.0
568
  },
569
  "flat:foreign":{
570
+ "p":1.0,
571
+ "r":0.2857142857,
572
+ "f":0.4444444444
573
  },
574
  "orphan":{
575
  "p":0.0,
587
  "f":0.0
588
  }
589
  },
590
+ "tag_acc":0.9330104092,
591
+ "lemma_acc":0.9032059186,
592
+ "ents_p":0.8116540445,
593
+ "ents_r":0.8065529803,
594
+ "ents_f":0.8090954723,
595
  "ents_per_type":{
596
  "PER":{
597
+ "p":0.8653387572,
598
+ "r":0.8840008598,
599
+ "f":0.874570264
600
  },
601
  "LOC":{
602
+ "p":0.8252660345,
603
+ "r":0.8337237514,
604
+ "f":0.8294733337
605
  },
606
  "ORG":{
607
+ "p":0.7441767868,
608
+ "r":0.7133587786,
609
+ "f":0.728441976
610
  },
611
  "MISC":{
612
+ "p":0.6901544402,
613
+ "r":0.6254556058,
614
+ "f":0.6562141491
615
  }
616
  },
617
+ "speed":4450.1044334226
618
  },
619
  "sources":[
620
  {
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0ed2c0bd002aa9a2099a471459930a7d41e0c7d063b2f767d27667c494d94cd7
3
- size 76433
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7dbe5cf70ad7f34890679369a1fb3be6e4e6711524cc27a30262d6c31b9bbcec
3
+ size 76485
ner/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5642d3b4a41fa4f2701b68bf8bc9826e8a9dfd6f3274c54818e38eb3ebb3b8cc
3
- size 6865402
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:25af1410b45dee99e2eda9362804894d31fa7be0c1f0a88d3f7b68ca9affa85b
3
+ size 6270202
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:afe203976b3194284f25142c5309dad5e51bc9dc5008f932c168b6d3766ec63e
3
  size 304828
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4178d4b5db29f103a2438f24a7e7bff1f828a350298180af4b00b2fab7c61ec4
3
  size 304828
parser/moves CHANGED
@@ -1 +1 @@
1
- ��moves��{"0":{"":25255},"1":{"":21680},"2":{"case":7258,"det":6062,"nsubj":1982,"punct":1645,"advmod":1210,"cc":1205,"mark":1051,"aux:tense":673,"amod":662,"nummod":595,"aux:pass":544,"obl:mod":483,"nsubj:pass":425,"cop":365,"expl:comp":204,"obj":170,"expl:subj":164,"iobj":139,"advcl":123,"nmod":92,"expl:pass":40,"vocative":35,"dep":0},"3":{"nmod":5132,"punct":3954,"amod":2083,"conj":1517,"obj":1410,"obl:mod":1184,"obl:arg":1078,"acl":782,"xcomp":739,"flat:name":657,"advmod":562,"fixed":409,"appos":408,"acl:relcl":365,"advcl":306,"ccomp":238,"obl:agent":206,"dep":138,"nummod":117,"parataxis":92,"nsubj":75,"flat:foreign":63},"4":{"ROOT":2219}}�cfg��neg_key�
1
+ ��moves��{"0":{"":25345},"1":{"":21571},"2":{"case":7318,"det":6066,"nsubj":1969,"punct":1660,"cc":1214,"advmod":1209,"mark":1055,"aux:tense":673,"amod":664,"nummod":609,"aux:pass":546,"obl:mod":480,"nsubj:pass":420,"cop":366,"expl:comp":204,"obj":170,"expl:subj":165,"iobj":139,"advcl":123,"nmod":92,"expl:pass":40,"vocative":35,"dep":0},"3":{"nmod":4995,"punct":4040,"amod":2051,"conj":1514,"obj":1405,"obl:mod":1188,"obl:arg":1070,"acl":785,"xcomp":739,"flat:name":622,"advmod":564,"fixed":413,"appos":412,"acl:relcl":368,"advcl":306,"ccomp":238,"obl:agent":203,"dep":142,"nummod":124,"parataxis":95,"nsubj":76,"flat:foreign":59},"4":{"ROOT":2231}}�cfg��neg_key�
senter/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7934162f365e3ac2d016179f61edefcba32f9afcd1c8df3ef7b384bbea4a8258
3
- size 197037
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1d1237795d42c750d60cc1817e7afb2b62c410c076c9cec22781fd28d96a22e4
3
+ size 197089
tok2vec/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:93e52299d1575b49f9e73fe15021c36d5a5c6357b8ad0e93e3512e38e0a06d07
3
- size 6734429
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6609774a4a94c1ed0810e85960a0a25544a3303076223fb3cba906198807af51
3
+ size 6139229
tokenizer CHANGED
The diff for this file is too large to render. See raw diff
vocab/key2row CHANGED
@@ -1 +1,3 @@
1
-
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:76be8b528d0075f7aae98d6fa57a6d3c83ae480a8469e668d7b0af968995ac71
3
+ size 1