KoichiYasuoka commited on
Commit
811a77a
·
1 Parent(s): 4581850

initial release

Browse files
Files changed (8) hide show
  1. README.md +31 -0
  2. config.json +1404 -0
  3. maker.py +56 -0
  4. pytorch_model.bin +3 -0
  5. special_tokens_map.json +56 -0
  6. tokenizer.json +3 -0
  7. tokenizer_config.json +0 -0
  8. ud.py +63 -0
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - "th"
4
+ tags:
5
+ - "thai"
6
+ - "token-classification"
7
+ - "pos"
8
+ - "dependency-parsing"
9
+ base_model: KoichiYasuoka/camembert-thai-base-upos
10
+ datasets:
11
+ - "universal_dependencies"
12
+ license: "apache-2.0"
13
+ pipeline_tag: "token-classification"
14
+ widget:
15
+ - text: "หลายหัวดีกว่าหัวเดียว"
16
+ ---
17
+
18
+ # camembert-thai-base-ud-goeswith
19
+
20
+ ## Model Description
21
+
22
+ This is a CamemBERT model pre-trained on Thai texts for POS-tagging and dependency-parsing (using `goeswith` for subwords), derived from [camembert-thai-base-upos](https://huggingface.co/KoichiYasuoka/camembert-thai-base-upos).
23
+
24
+ ## How to Use
25
+
26
+ ```
27
+ from transformers import pipeline
28
+ nlp=pipeline("universal-dependencies","KoichiYasuoka/camembert-thai-base-ud-goeswith",trust_remote_code=True,aggregation_strategy="simple")
29
+ print(nlp("หลายหัวดีกว่าหัวเดียว"))
30
+ ```
31
+
config.json ADDED
@@ -0,0 +1,1404 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "CamembertForTokenClassification"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "bos_token_id": 0,
7
+ "classifier_dropout": null,
8
+ "custom_pipelines": {
9
+ "universal-dependencies": {
10
+ "impl": "ud.UniversalDependenciesPipeline"
11
+ }
12
+ },
13
+ "eos_token_id": 2,
14
+ "hidden_act": "gelu",
15
+ "hidden_dropout_prob": 0.1,
16
+ "hidden_size": 768,
17
+ "id2label": {
18
+ "0": "-|_|dep",
19
+ "1": "ADP|Foreign=Yes|case",
20
+ "2": "ADP|NounType=Class|case",
21
+ "3": "ADP|Prefix=Yes|case",
22
+ "4": "ADP|Prefix=Yes|mark",
23
+ "5": "ADP|_|acl",
24
+ "6": "ADP|_|advcl",
25
+ "7": "ADP|_|advmod",
26
+ "8": "ADP|_|appos",
27
+ "9": "ADP|_|case",
28
+ "10": "ADP|_|cc",
29
+ "11": "ADP|_|cc:preconj",
30
+ "12": "ADP|_|compound",
31
+ "13": "ADP|_|conj",
32
+ "14": "ADP|_|csubj",
33
+ "15": "ADP|_|dep",
34
+ "16": "ADP|_|fixed",
35
+ "17": "ADP|_|flat",
36
+ "18": "ADP|_|mark",
37
+ "19": "ADP|_|nmod",
38
+ "20": "ADP|_|nsubj",
39
+ "21": "ADP|_|obl",
40
+ "22": "ADP|_|orphan",
41
+ "23": "ADP|_|root",
42
+ "24": "ADV|Foreign=Yes|advmod",
43
+ "25": "ADV|NumType=Mult|advmod",
44
+ "26": "ADV|PartType=Adv|advmod",
45
+ "27": "ADV|PartType=Adv|mark",
46
+ "28": "ADV|PartType=Enp|advmod",
47
+ "29": "ADV|PartType=Int|advmod",
48
+ "30": "ADV|PartType=Int|fixed",
49
+ "31": "ADV|Prefix=Yes|advmod",
50
+ "32": "ADV|Prefix=Yes|mark",
51
+ "33": "ADV|PronType=Int|advmod",
52
+ "34": "ADV|_|acl",
53
+ "35": "ADV|_|advcl",
54
+ "36": "ADV|_|advmod",
55
+ "37": "ADV|_|aux",
56
+ "38": "ADV|_|case",
57
+ "39": "ADV|_|cc",
58
+ "40": "ADV|_|ccomp",
59
+ "41": "ADV|_|compound",
60
+ "42": "ADV|_|conj",
61
+ "43": "ADV|_|dep",
62
+ "44": "ADV|_|det",
63
+ "45": "ADV|_|discourse",
64
+ "46": "ADV|_|fixed",
65
+ "47": "ADV|_|flat",
66
+ "48": "ADV|_|mark",
67
+ "49": "ADV|_|nmod",
68
+ "50": "ADV|_|obj",
69
+ "51": "ADV|_|orphan",
70
+ "52": "ADV|_|root",
71
+ "53": "ADV|_|xcomp",
72
+ "54": "AUX|Foreign=Yes|aux",
73
+ "55": "AUX|NounType=Class|appos",
74
+ "56": "AUX|Prefix=Yes|aux",
75
+ "57": "AUX|VerbType=Cop|acl",
76
+ "58": "AUX|VerbType=Cop|advcl",
77
+ "59": "AUX|VerbType=Cop|aux",
78
+ "60": "AUX|VerbType=Cop|conj",
79
+ "61": "AUX|VerbType=Cop|cop",
80
+ "62": "AUX|VerbType=Cop|mark",
81
+ "63": "AUX|VerbType=Cop|root",
82
+ "64": "AUX|_|acl",
83
+ "65": "AUX|_|advmod",
84
+ "66": "AUX|_|aux",
85
+ "67": "AUX|_|aux:pass",
86
+ "68": "AUX|_|ccomp",
87
+ "69": "AUX|_|clf",
88
+ "70": "AUX|_|compound",
89
+ "71": "AUX|_|conj",
90
+ "72": "AUX|_|cop",
91
+ "73": "AUX|_|fixed",
92
+ "74": "AUX|_|mark",
93
+ "75": "AUX|_|root",
94
+ "76": "CCONJ|Foreign=Yes|cc",
95
+ "77": "CCONJ|PronType=Prs|cc",
96
+ "78": "CCONJ|_|advmod",
97
+ "79": "CCONJ|_|case",
98
+ "80": "CCONJ|_|cc",
99
+ "81": "CCONJ|_|compound",
100
+ "82": "CCONJ|_|conj",
101
+ "83": "CCONJ|_|discourse",
102
+ "84": "CCONJ|_|fixed",
103
+ "85": "CCONJ|_|flat",
104
+ "86": "CCONJ|_|mark",
105
+ "87": "CCONJ|_|nsubj",
106
+ "88": "CCONJ|_|obj",
107
+ "89": "CCONJ|_|obl",
108
+ "90": "CCONJ|_|orphan",
109
+ "91": "CCONJ|_|root",
110
+ "92": "DET|NumType=Mult|det",
111
+ "93": "DET|PartType=Emp|det",
112
+ "94": "DET|PartType=Int|det",
113
+ "95": "DET|PronType=Int|det",
114
+ "96": "DET|_|advmod",
115
+ "97": "DET|_|case",
116
+ "98": "DET|_|cc:preconj",
117
+ "99": "DET|_|compound",
118
+ "100": "DET|_|conj",
119
+ "101": "DET|_|dep",
120
+ "102": "DET|_|det",
121
+ "103": "DET|_|det:predet",
122
+ "104": "DET|_|discourse",
123
+ "105": "DET|_|fixed",
124
+ "106": "DET|_|flat",
125
+ "107": "DET|_|list",
126
+ "108": "DET|_|mark",
127
+ "109": "DET|_|nmod",
128
+ "110": "DET|_|nsubj",
129
+ "111": "DET|_|nsubj:pass",
130
+ "112": "DET|_|nummod",
131
+ "113": "DET|_|obj",
132
+ "114": "DET|_|obl",
133
+ "115": "DET|_|obl:tmod",
134
+ "116": "DET|_|orphan",
135
+ "117": "DET|_|root",
136
+ "118": "INTJ|_|acl",
137
+ "119": "INTJ|_|nsubj",
138
+ "120": "INTJ|_|root",
139
+ "121": "NOUN|Abbr=Yes|Foreign=Yes|nmod",
140
+ "122": "NOUN|Abbr=Yes|Prefix=Yes|flat",
141
+ "123": "NOUN|Abbr=Yes|acl",
142
+ "124": "NOUN|Abbr=Yes|appos",
143
+ "125": "NOUN|Abbr=Yes|clf",
144
+ "126": "NOUN|Abbr=Yes|conj",
145
+ "127": "NOUN|Abbr=Yes|fixed",
146
+ "128": "NOUN|Abbr=Yes|flat",
147
+ "129": "NOUN|Abbr=Yes|nmod",
148
+ "130": "NOUN|Abbr=Yes|nsubj",
149
+ "131": "NOUN|Abbr=Yes|obj",
150
+ "132": "NOUN|Abbr=Yes|obl",
151
+ "133": "NOUN|Foreign=Yes|NounType=Class|clf",
152
+ "134": "NOUN|Foreign=Yes|NounType=Class|obj",
153
+ "135": "NOUN|Foreign=Yes|Prefix=Yes|appos",
154
+ "136": "NOUN|Foreign=Yes|Prefix=Yes|flat",
155
+ "137": "NOUN|Foreign=Yes|acl",
156
+ "138": "NOUN|Foreign=Yes|advcl",
157
+ "139": "NOUN|Foreign=Yes|advmod",
158
+ "140": "NOUN|Foreign=Yes|appos",
159
+ "141": "NOUN|Foreign=Yes|ccomp",
160
+ "142": "NOUN|Foreign=Yes|clf",
161
+ "143": "NOUN|Foreign=Yes|compound",
162
+ "144": "NOUN|Foreign=Yes|conj",
163
+ "145": "NOUN|Foreign=Yes|dislocated",
164
+ "146": "NOUN|Foreign=Yes|flat",
165
+ "147": "NOUN|Foreign=Yes|iobj",
166
+ "148": "NOUN|Foreign=Yes|list",
167
+ "149": "NOUN|Foreign=Yes|nmod",
168
+ "150": "NOUN|Foreign=Yes|nsubj",
169
+ "151": "NOUN|Foreign=Yes|obj",
170
+ "152": "NOUN|Foreign=Yes|obl",
171
+ "153": "NOUN|Foreign=Yes|root",
172
+ "154": "NOUN|Foreign=Yes|xcomp",
173
+ "155": "NOUN|NameType=Com|nmod",
174
+ "156": "NOUN|NameType=Geo|nmod",
175
+ "157": "NOUN|NameType=Geo|nsubj",
176
+ "158": "NOUN|NameType=Geo|obj",
177
+ "159": "NOUN|NameType=Nat|nmod",
178
+ "160": "NOUN|NameType=Oth|conj",
179
+ "161": "NOUN|NameType=Oth|flat",
180
+ "162": "NOUN|NameType=Oth|nmod",
181
+ "163": "NOUN|NameType=Oth|nsubj",
182
+ "164": "NOUN|NameType=Pro|nmod",
183
+ "165": "NOUN|NameType=Prs|nmod",
184
+ "166": "NOUN|NameType=Prs|nsubj",
185
+ "167": "NOUN|NounType=Class|Prefix=Yes|advcl",
186
+ "168": "NOUN|NounType=Class|Prefix=Yes|advmod",
187
+ "169": "NOUN|NounType=Class|Prefix=Yes|clf",
188
+ "170": "NOUN|NounType=Class|Prefix=Yes|mark",
189
+ "171": "NOUN|NounType=Class|Prefix=Yes|nmod",
190
+ "172": "NOUN|NounType=Class|Prefix=Yes|nsubj",
191
+ "173": "NOUN|NounType=Class|Prefix=Yes|obj",
192
+ "174": "NOUN|NounType=Class|acl",
193
+ "175": "NOUN|NounType=Class|advcl",
194
+ "176": "NOUN|NounType=Class|advmod",
195
+ "177": "NOUN|NounType=Class|appos",
196
+ "178": "NOUN|NounType=Class|cc",
197
+ "179": "NOUN|NounType=Class|ccomp",
198
+ "180": "NOUN|NounType=Class|clf",
199
+ "181": "NOUN|NounType=Class|compound",
200
+ "182": "NOUN|NounType=Class|conj",
201
+ "183": "NOUN|NounType=Class|dislocated",
202
+ "184": "NOUN|NounType=Class|fixed",
203
+ "185": "NOUN|NounType=Class|flat",
204
+ "186": "NOUN|NounType=Class|iobj",
205
+ "187": "NOUN|NounType=Class|list",
206
+ "188": "NOUN|NounType=Class|nmod",
207
+ "189": "NOUN|NounType=Class|nsubj",
208
+ "190": "NOUN|NounType=Class|nummod",
209
+ "191": "NOUN|NounType=Class|obj",
210
+ "192": "NOUN|NounType=Class|obl",
211
+ "193": "NOUN|NounType=Class|orphan",
212
+ "194": "NOUN|NounType=Class|root",
213
+ "195": "NOUN|NounType=Class|xcomp",
214
+ "196": "NOUN|NumType=Mult|advcl",
215
+ "197": "NOUN|NumType=Mult|nmod",
216
+ "198": "NOUN|NumType=Mult|obj",
217
+ "199": "NOUN|PartType=Enp|obj",
218
+ "200": "NOUN|PartType=Enp|obl",
219
+ "201": "NOUN|PartType=Int|obj",
220
+ "202": "NOUN|PartType=Res|nmod",
221
+ "203": "NOUN|PartType=Res|obj",
222
+ "204": "NOUN|Prefix=Yes|acl",
223
+ "205": "NOUN|Prefix=Yes|advcl",
224
+ "206": "NOUN|Prefix=Yes|advmod",
225
+ "207": "NOUN|Prefix=Yes|appos",
226
+ "208": "NOUN|Prefix=Yes|case",
227
+ "209": "NOUN|Prefix=Yes|cc",
228
+ "210": "NOUN|Prefix=Yes|ccomp",
229
+ "211": "NOUN|Prefix=Yes|clf",
230
+ "212": "NOUN|Prefix=Yes|compound",
231
+ "213": "NOUN|Prefix=Yes|conj",
232
+ "214": "NOUN|Prefix=Yes|csubj",
233
+ "215": "NOUN|Prefix=Yes|dislocated",
234
+ "216": "NOUN|Prefix=Yes|fixed",
235
+ "217": "NOUN|Prefix=Yes|flat",
236
+ "218": "NOUN|Prefix=Yes|iobj",
237
+ "219": "NOUN|Prefix=Yes|list",
238
+ "220": "NOUN|Prefix=Yes|nmod",
239
+ "221": "NOUN|Prefix=Yes|nsubj",
240
+ "222": "NOUN|Prefix=Yes|nummod",
241
+ "223": "NOUN|Prefix=Yes|obj",
242
+ "224": "NOUN|Prefix=Yes|obl",
243
+ "225": "NOUN|Prefix=Yes|orphan",
244
+ "226": "NOUN|Prefix=Yes|root",
245
+ "227": "NOUN|Prefix=Yes|xcomp",
246
+ "228": "NOUN|_|acl",
247
+ "229": "NOUN|_|acl:relcl",
248
+ "230": "NOUN|_|advcl",
249
+ "231": "NOUN|_|advmod",
250
+ "232": "NOUN|_|appos",
251
+ "233": "NOUN|_|aux",
252
+ "234": "NOUN|_|case",
253
+ "235": "NOUN|_|cc",
254
+ "236": "NOUN|_|ccomp",
255
+ "237": "NOUN|_|clf",
256
+ "238": "NOUN|_|compound",
257
+ "239": "NOUN|_|conj",
258
+ "240": "NOUN|_|cop",
259
+ "241": "NOUN|_|csubj",
260
+ "242": "NOUN|_|discourse",
261
+ "243": "NOUN|_|dislocated",
262
+ "244": "NOUN|_|expl",
263
+ "245": "NOUN|_|fixed",
264
+ "246": "NOUN|_|flat",
265
+ "247": "NOUN|_|flat:name",
266
+ "248": "NOUN|_|iobj",
267
+ "249": "NOUN|_|list",
268
+ "250": "NOUN|_|mark",
269
+ "251": "NOUN|_|nmod",
270
+ "252": "NOUN|_|nmod:poss",
271
+ "253": "NOUN|_|nsubj",
272
+ "254": "NOUN|_|nsubj:pass",
273
+ "255": "NOUN|_|nummod",
274
+ "256": "NOUN|_|obj",
275
+ "257": "NOUN|_|obl",
276
+ "258": "NOUN|_|obl:poss",
277
+ "259": "NOUN|_|obl:tmod",
278
+ "260": "NOUN|_|orphan",
279
+ "261": "NOUN|_|parataxis",
280
+ "262": "NOUN|_|root",
281
+ "263": "NOUN|_|vocative",
282
+ "264": "NOUN|_|xcomp",
283
+ "265": "NUM|Abbr=Yes|flat",
284
+ "266": "NUM|Abbr=Yes|nummod",
285
+ "267": "NUM|Abbr=Yes|obj",
286
+ "268": "NUM|Foreign=Yes|clf",
287
+ "269": "NUM|NumType=Mult|advmod",
288
+ "270": "NUM|NumType=Mult|nummod",
289
+ "271": "NUM|Prefix=Yes|nummod",
290
+ "272": "NUM|_|acl",
291
+ "273": "NUM|_|acl:relcl",
292
+ "274": "NUM|_|advcl",
293
+ "275": "NUM|_|advmod",
294
+ "276": "NUM|_|appos",
295
+ "277": "NUM|_|case",
296
+ "278": "NUM|_|ccomp",
297
+ "279": "NUM|_|clf",
298
+ "280": "NUM|_|compound",
299
+ "281": "NUM|_|conj",
300
+ "282": "NUM|_|dep",
301
+ "283": "NUM|_|det",
302
+ "284": "NUM|_|fixed",
303
+ "285": "NUM|_|flat",
304
+ "286": "NUM|_|flat:name",
305
+ "287": "NUM|_|iobj",
306
+ "288": "NUM|_|nmod",
307
+ "289": "NUM|_|nsubj",
308
+ "290": "NUM|_|nummod",
309
+ "291": "NUM|_|obj",
310
+ "292": "NUM|_|obl",
311
+ "293": "NUM|_|obl:poss",
312
+ "294": "NUM|_|obl:tmod",
313
+ "295": "NUM|_|root",
314
+ "296": "NUM|_|xcomp",
315
+ "297": "PART|Aspect=Perf|aux",
316
+ "298": "PART|Aspect=Perf|xcomp",
317
+ "299": "PART|Aspect=Prog|aux",
318
+ "300": "PART|NameType=Oth|advmod",
319
+ "301": "PART|NounType=Class|PartType=Emp|Prefix=Yes|mark",
320
+ "302": "PART|NounType=Class|PartType=Emp|mark",
321
+ "303": "PART|NounType=Class|Prefix=Yes|mark",
322
+ "304": "PART|NumType=Mult|PartType=Emp|mark",
323
+ "305": "PART|PartType=Adj|acl",
324
+ "306": "PART|PartType=Adj|compound",
325
+ "307": "PART|PartType=Adj|mark",
326
+ "308": "PART|PartType=Adj|nmod",
327
+ "309": "PART|PartType=Adj|orphan",
328
+ "310": "PART|PartType=Adv|advmod",
329
+ "311": "PART|PartType=Adv|mark",
330
+ "312": "PART|PartType=Emp|Prefix=Yes|advmod",
331
+ "313": "PART|PartType=Emp|Prefix=Yes|aux",
332
+ "314": "PART|PartType=Emp|Prefix=Yes|mark",
333
+ "315": "PART|PartType=Emp|acl",
334
+ "316": "PART|PartType=Emp|advmod",
335
+ "317": "PART|PartType=Emp|aux",
336
+ "318": "PART|PartType=Emp|case",
337
+ "319": "PART|PartType=Emp|compound",
338
+ "320": "PART|PartType=Emp|det",
339
+ "321": "PART|PartType=Emp|discourse",
340
+ "322": "PART|PartType=Emp|fixed",
341
+ "323": "PART|PartType=Emp|mark",
342
+ "324": "PART|PartType=Emp|nmod",
343
+ "325": "PART|PartType=Enp|acl",
344
+ "326": "PART|PartType=Enp|advmod",
345
+ "327": "PART|PartType=Enp|compound",
346
+ "328": "PART|PartType=Enp|dep",
347
+ "329": "PART|PartType=Enp|det",
348
+ "330": "PART|PartType=Enp|discourse",
349
+ "331": "PART|PartType=Enp|fixed",
350
+ "332": "PART|PartType=Enp|obl",
351
+ "333": "PART|PartType=Int|acl",
352
+ "334": "PART|PartType=Int|advmod",
353
+ "335": "PART|PartType=Int|dep",
354
+ "336": "PART|PartType=Int|discourse",
355
+ "337": "PART|PartType=Int|mark",
356
+ "338": "PART|PartType=Int|nmod",
357
+ "339": "PART|PartType=Int|obj",
358
+ "340": "PART|PartType=Int|obl",
359
+ "341": "PART|PartType=Neg|acl",
360
+ "342": "PART|PartType=Neg|advcl",
361
+ "343": "PART|PartType=Neg|advmod",
362
+ "344": "PART|PartType=Neg|aux",
363
+ "345": "PART|PartType=Neg|fixed",
364
+ "346": "PART|PartType=Neg|mark",
365
+ "347": "PART|PartType=Res|advmod",
366
+ "348": "PART|PartType=Res|discourse",
367
+ "349": "PART|PartType=Res|fixed",
368
+ "350": "PART|Polarity=Neg|advmod",
369
+ "351": "PART|Prefix=Yes|acl",
370
+ "352": "PART|Prefix=Yes|advmod",
371
+ "353": "PART|Prefix=Yes|aux",
372
+ "354": "PART|Prefix=Yes|mark",
373
+ "355": "PART|Prefix=Yes|nmod",
374
+ "356": "PART|PronType=Int|acl",
375
+ "357": "PART|PronType=Int|advmod",
376
+ "358": "PART|PronType=Int|discourse",
377
+ "359": "PART|PronType=Int|obj",
378
+ "360": "PART|PronType=Int|root",
379
+ "361": "PART|_|acl",
380
+ "362": "PART|_|advmod",
381
+ "363": "PART|_|aux",
382
+ "364": "PART|_|cc",
383
+ "365": "PART|_|cc:preconj",
384
+ "366": "PART|_|ccomp",
385
+ "367": "PART|_|clf",
386
+ "368": "PART|_|compound",
387
+ "369": "PART|_|compound:prt",
388
+ "370": "PART|_|conj",
389
+ "371": "PART|_|discourse",
390
+ "372": "PART|_|fixed",
391
+ "373": "PART|_|mark",
392
+ "374": "PART|_|nmod",
393
+ "375": "PART|_|nmod:poss",
394
+ "376": "PART|_|nsubj",
395
+ "377": "PART|_|obj",
396
+ "378": "PART|_|obl",
397
+ "379": "PART|_|root",
398
+ "380": "PRON|NounType=Class|clf",
399
+ "381": "PRON|Person=1|compound",
400
+ "382": "PRON|Person=1|nmod:poss",
401
+ "383": "PRON|Person=1|nsubj",
402
+ "384": "PRON|Person=1|nsubj:pass",
403
+ "385": "PRON|Person=1|obj",
404
+ "386": "PRON|Person=1|obl",
405
+ "387": "PRON|Person=1|obl:poss",
406
+ "388": "PRON|Person=2|compound",
407
+ "389": "PRON|Person=2|nmod:poss",
408
+ "390": "PRON|Person=2|nsubj",
409
+ "391": "PRON|Person=2|obj",
410
+ "392": "PRON|Person=2|obl",
411
+ "393": "PRON|Person=3|advmod",
412
+ "394": "PRON|Person=3|appos",
413
+ "395": "PRON|Person=3|compound",
414
+ "396": "PRON|Person=3|conj",
415
+ "397": "PRON|Person=3|nmod",
416
+ "398": "PRON|Person=3|nmod:poss",
417
+ "399": "PRON|Person=3|nsubj",
418
+ "400": "PRON|Person=3|nsubj:pass",
419
+ "401": "PRON|Person=3|obj",
420
+ "402": "PRON|Person=3|obl",
421
+ "403": "PRON|Person=3|obl:poss",
422
+ "404": "PRON|Person=3|reparandum",
423
+ "405": "PRON|Person=3|xcomp",
424
+ "406": "PRON|PronType=Int|nsubj",
425
+ "407": "PRON|PronType=Int|obj",
426
+ "408": "PRON|PronType=Int|obl",
427
+ "409": "PRON|PronType=Int|root",
428
+ "410": "PRON|PronType=Prs|advcl",
429
+ "411": "PRON|PronType=Prs|advmod",
430
+ "412": "PRON|PronType=Prs|ccomp",
431
+ "413": "PRON|PronType=Prs|clf",
432
+ "414": "PRON|PronType=Prs|conj",
433
+ "415": "PRON|PronType=Prs|expl",
434
+ "416": "PRON|PronType=Prs|nmod",
435
+ "417": "PRON|PronType=Prs|nsubj",
436
+ "418": "PRON|PronType=Prs|obj",
437
+ "419": "PRON|PronType=Prs|obl",
438
+ "420": "PRON|PronType=Prs|root",
439
+ "421": "PRON|PronType=Rcp|advmod",
440
+ "422": "PRON|PronType=Rcp|iobj",
441
+ "423": "PRON|PronType=Rcp|nmod",
442
+ "424": "PRON|PronType=Rcp|obj",
443
+ "425": "PRON|PronType=Rcp|obl",
444
+ "426": "PRON|_|acl",
445
+ "427": "PRON|_|acl:relcl",
446
+ "428": "PRON|_|advcl",
447
+ "429": "PRON|_|advmod",
448
+ "430": "PRON|_|appos",
449
+ "431": "PRON|_|ccomp",
450
+ "432": "PRON|_|compound",
451
+ "433": "PRON|_|conj",
452
+ "434": "PRON|_|csubj",
453
+ "435": "PRON|_|det",
454
+ "436": "PRON|_|discourse",
455
+ "437": "PRON|_|dislocated",
456
+ "438": "PRON|_|expl",
457
+ "439": "PRON|_|fixed",
458
+ "440": "PRON|_|flat",
459
+ "441": "PRON|_|iobj",
460
+ "442": "PRON|_|mark",
461
+ "443": "PRON|_|nmod",
462
+ "444": "PRON|_|nmod:poss",
463
+ "445": "PRON|_|nsubj",
464
+ "446": "PRON|_|obj",
465
+ "447": "PRON|_|obl",
466
+ "448": "PRON|_|obl:poss",
467
+ "449": "PRON|_|root",
468
+ "450": "PRON|_|xcomp",
469
+ "451": "PROPN|Abbr=Yes|Foreign=Yes|NameType=Oth|obj",
470
+ "452": "PROPN|Abbr=Yes|NameType=Com|advmod",
471
+ "453": "PROPN|Abbr=Yes|NameType=Com|nmod",
472
+ "454": "PROPN|Abbr=Yes|nmod",
473
+ "455": "PROPN|Abbr=Yes|nsubj",
474
+ "456": "PROPN|Foreign=Yes|NameType=Com|list",
475
+ "457": "PROPN|Foreign=Yes|NameType=Com|nmod",
476
+ "458": "PROPN|Foreign=Yes|NameType=Com|nsubj",
477
+ "459": "PROPN|Foreign=Yes|NameType=Com|obl",
478
+ "460": "PROPN|Foreign=Yes|NameType=Geo|obj",
479
+ "461": "PROPN|Foreign=Yes|NameType=Geo|obl",
480
+ "462": "PROPN|Foreign=Yes|NameType=Giv|nsubj",
481
+ "463": "PROPN|Foreign=Yes|NameType=Oth|conj",
482
+ "464": "PROPN|Foreign=Yes|NameType=Oth|flat",
483
+ "465": "PROPN|Foreign=Yes|NameType=Oth|nmod",
484
+ "466": "PROPN|Foreign=Yes|NameType=Prs|conj",
485
+ "467": "PROPN|Foreign=Yes|NameType=Prs|flat",
486
+ "468": "PROPN|Foreign=Yes|NameType=Prs|nmod",
487
+ "469": "PROPN|Foreign=Yes|NameType=Prs|nsubj",
488
+ "470": "PROPN|Foreign=Yes|NameType=Prs|obj",
489
+ "471": "PROPN|Foreign=Yes|NameType=Prs|obl",
490
+ "472": "PROPN|Foreign=Yes|NameType=Sur|flat",
491
+ "473": "PROPN|Foreign=Yes|appos",
492
+ "474": "PROPN|Foreign=Yes|ccomp",
493
+ "475": "PROPN|Foreign=Yes|compound",
494
+ "476": "PROPN|Foreign=Yes|conj",
495
+ "477": "PROPN|Foreign=Yes|flat",
496
+ "478": "PROPN|Foreign=Yes|iobj",
497
+ "479": "PROPN|Foreign=Yes|list",
498
+ "480": "PROPN|Foreign=Yes|nmod",
499
+ "481": "PROPN|Foreign=Yes|nsubj",
500
+ "482": "PROPN|Foreign=Yes|obj",
501
+ "483": "PROPN|Foreign=Yes|obl",
502
+ "484": "PROPN|Foreign=Yes|root",
503
+ "485": "PROPN|NameType=Com|appos",
504
+ "486": "PROPN|NameType=Com|conj",
505
+ "487": "PROPN|NameType=Com|flat",
506
+ "488": "PROPN|NameType=Com|list",
507
+ "489": "PROPN|NameType=Com|nmod",
508
+ "490": "PROPN|NameType=Com|nsubj",
509
+ "491": "PROPN|NameType=Com|obj",
510
+ "492": "PROPN|NameType=Com|obl",
511
+ "493": "PROPN|NameType=Geo|compound",
512
+ "494": "PROPN|NameType=Geo|conj",
513
+ "495": "PROPN|NameType=Geo|flat",
514
+ "496": "PROPN|NameType=Geo|list",
515
+ "497": "PROPN|NameType=Geo|nmod",
516
+ "498": "PROPN|NameType=Geo|nsubj",
517
+ "499": "PROPN|NameType=Geo|nummod",
518
+ "500": "PROPN|NameType=Geo|obj",
519
+ "501": "PROPN|NameType=Geo|obl",
520
+ "502": "PROPN|NameType=Geo|root",
521
+ "503": "PROPN|NameType=Giv|acl",
522
+ "504": "PROPN|NameType=Giv|appos",
523
+ "505": "PROPN|NameType=Giv|ccomp",
524
+ "506": "PROPN|NameType=Giv|conj",
525
+ "507": "PROPN|NameType=Giv|dislocated",
526
+ "508": "PROPN|NameType=Giv|flat",
527
+ "509": "PROPN|NameType=Giv|list",
528
+ "510": "PROPN|NameType=Giv|nmod",
529
+ "511": "PROPN|NameType=Giv|nsubj",
530
+ "512": "PROPN|NameType=Giv|obj",
531
+ "513": "PROPN|NameType=Giv|obl",
532
+ "514": "PROPN|NameType=Giv|root",
533
+ "515": "PROPN|NameType=Nat|acl",
534
+ "516": "PROPN|NameType=Nat|appos",
535
+ "517": "PROPN|NameType=Nat|compound",
536
+ "518": "PROPN|NameType=Nat|conj",
537
+ "519": "PROPN|NameType=Nat|csubj",
538
+ "520": "PROPN|NameType=Nat|flat",
539
+ "521": "PROPN|NameType=Nat|list",
540
+ "522": "PROPN|NameType=Nat|nmod",
541
+ "523": "PROPN|NameType=Nat|nsubj",
542
+ "524": "PROPN|NameType=Nat|nummod",
543
+ "525": "PROPN|NameType=Nat|obj",
544
+ "526": "PROPN|NameType=Nat|obl",
545
+ "527": "PROPN|NameType=Oth|acl",
546
+ "528": "PROPN|NameType=Oth|appos",
547
+ "529": "PROPN|NameType=Oth|compound",
548
+ "530": "PROPN|NameType=Oth|conj",
549
+ "531": "PROPN|NameType=Oth|dislocated",
550
+ "532": "PROPN|NameType=Oth|flat",
551
+ "533": "PROPN|NameType=Oth|nmod",
552
+ "534": "PROPN|NameType=Oth|nsubj",
553
+ "535": "PROPN|NameType=Oth|obj",
554
+ "536": "PROPN|NameType=Oth|obl",
555
+ "537": "PROPN|NameType=Oth|root",
556
+ "538": "PROPN|NameType=Pro|advcl",
557
+ "539": "PROPN|NameType=Pro|flat",
558
+ "540": "PROPN|NameType=Pro|nmod",
559
+ "541": "PROPN|NameType=Pro|nsubj",
560
+ "542": "PROPN|NameType=Pro|obj",
561
+ "543": "PROPN|NameType=Pro|obl",
562
+ "544": "PROPN|NameType=Prs|conj",
563
+ "545": "PROPN|NameType=Prs|discourse",
564
+ "546": "PROPN|NameType=Prs|dislocated",
565
+ "547": "PROPN|NameType=Prs|flat",
566
+ "548": "PROPN|NameType=Prs|list",
567
+ "549": "PROPN|NameType=Prs|nmod",
568
+ "550": "PROPN|NameType=Prs|nsubj",
569
+ "551": "PROPN|NameType=Prs|obj",
570
+ "552": "PROPN|NameType=Prs|obl",
571
+ "553": "PROPN|NameType=Prs|vocative",
572
+ "554": "PROPN|NameType=Sur|flat",
573
+ "555": "PROPN|NameType=Sur|nmod",
574
+ "556": "PROPN|NameType=Sur|nsubj",
575
+ "557": "PROPN|NounType=Class|clf",
576
+ "558": "PROPN|Prefix=Yes|nmod",
577
+ "559": "PROPN|Prefix=Yes|nsubj",
578
+ "560": "PROPN|_|acl",
579
+ "561": "PROPN|_|acl:relcl",
580
+ "562": "PROPN|_|advmod",
581
+ "563": "PROPN|_|appos",
582
+ "564": "PROPN|_|aux",
583
+ "565": "PROPN|_|cc",
584
+ "566": "PROPN|_|ccomp",
585
+ "567": "PROPN|_|clf",
586
+ "568": "PROPN|_|compound",
587
+ "569": "PROPN|_|conj",
588
+ "570": "PROPN|_|fixed",
589
+ "571": "PROPN|_|flat",
590
+ "572": "PROPN|_|flat:name",
591
+ "573": "PROPN|_|goeswith",
592
+ "574": "PROPN|_|iobj",
593
+ "575": "PROPN|_|list",
594
+ "576": "PROPN|_|nmod",
595
+ "577": "PROPN|_|nmod:poss",
596
+ "578": "PROPN|_|nsubj",
597
+ "579": "PROPN|_|nsubj:pass",
598
+ "580": "PROPN|_|obj",
599
+ "581": "PROPN|_|obl",
600
+ "582": "PROPN|_|obl:poss",
601
+ "583": "PROPN|_|obl:tmod",
602
+ "584": "PROPN|_|root",
603
+ "585": "PROPN|_|xcomp",
604
+ "586": "PUNCT|NounType=Class|punct",
605
+ "587": "PUNCT|_|advmod",
606
+ "588": "PUNCT|_|clf",
607
+ "589": "PUNCT|_|dep",
608
+ "590": "PUNCT|_|punct",
609
+ "591": "PUNCT|_|root",
610
+ "592": "SCONJ|NumType=Mult|mark",
611
+ "593": "SCONJ|Prefix=Yes|cc",
612
+ "594": "SCONJ|Prefix=Yes|mark",
613
+ "595": "SCONJ|VerbType=Cop|mark",
614
+ "596": "SCONJ|_|advcl",
615
+ "597": "SCONJ|_|advmod",
616
+ "598": "SCONJ|_|case",
617
+ "599": "SCONJ|_|cc",
618
+ "600": "SCONJ|_|compound",
619
+ "601": "SCONJ|_|discourse",
620
+ "602": "SCONJ|_|fixed",
621
+ "603": "SCONJ|_|flat",
622
+ "604": "SCONJ|_|mark",
623
+ "605": "SCONJ|_|nsubj",
624
+ "606": "SCONJ|_|orphan",
625
+ "607": "SCONJ|_|root",
626
+ "608": "SYM|_|advmod",
627
+ "609": "SYM|_|clf",
628
+ "610": "SYM|_|dep",
629
+ "611": "SYM|_|nmod",
630
+ "612": "SYM|_|nsubj",
631
+ "613": "SYM|_|obj",
632
+ "614": "SYM|_|obl",
633
+ "615": "SYM|_|xcomp",
634
+ "616": "VERB|Abbr=Yes|acl",
635
+ "617": "VERB|Foreign=Yes|acl",
636
+ "618": "VERB|Foreign=Yes|advcl",
637
+ "619": "VERB|Foreign=Yes|ccomp",
638
+ "620": "VERB|Foreign=Yes|compound",
639
+ "621": "VERB|Foreign=Yes|conj",
640
+ "622": "VERB|Foreign=Yes|flat",
641
+ "623": "VERB|Foreign=Yes|nmod",
642
+ "624": "VERB|Foreign=Yes|nsubj",
643
+ "625": "VERB|Foreign=Yes|root",
644
+ "626": "VERB|Foreign=Yes|xcomp",
645
+ "627": "VERB|Mood=Imp|aux",
646
+ "628": "VERB|Mood=Imp|xcomp",
647
+ "629": "VERB|NounType=Class|acl",
648
+ "630": "VERB|NounType=Class|compound",
649
+ "631": "VERB|PartType=Adj|acl",
650
+ "632": "VERB|Prefix=Yes|acl",
651
+ "633": "VERB|Prefix=Yes|advcl",
652
+ "634": "VERB|Prefix=Yes|ccomp",
653
+ "635": "VERB|Prefix=Yes|compound",
654
+ "636": "VERB|Prefix=Yes|conj",
655
+ "637": "VERB|Prefix=Yes|nsubj",
656
+ "638": "VERB|Prefix=Yes|parataxis",
657
+ "639": "VERB|Prefix=Yes|root",
658
+ "640": "VERB|VerbType=Cop|acl",
659
+ "641": "VERB|VerbType=Cop|advcl",
660
+ "642": "VERB|VerbType=Cop|advmod",
661
+ "643": "VERB|VerbType=Cop|ccomp",
662
+ "644": "VERB|VerbType=Cop|compound",
663
+ "645": "VERB|VerbType=Cop|cop",
664
+ "646": "VERB|VerbType=Cop|parataxis",
665
+ "647": "VERB|VerbType=Cop|root",
666
+ "648": "VERB|Voice=Pass|aux",
667
+ "649": "VERB|Voice=Pass|aux:pass",
668
+ "650": "VERB|_|acl",
669
+ "651": "VERB|_|acl:relcl",
670
+ "652": "VERB|_|advcl",
671
+ "653": "VERB|_|advmod",
672
+ "654": "VERB|_|appos",
673
+ "655": "VERB|_|aux",
674
+ "656": "VERB|_|case",
675
+ "657": "VERB|_|cc",
676
+ "658": "VERB|_|ccomp",
677
+ "659": "VERB|_|clf",
678
+ "660": "VERB|_|compound",
679
+ "661": "VERB|_|conj",
680
+ "662": "VERB|_|cop",
681
+ "663": "VERB|_|csubj",
682
+ "664": "VERB|_|dep",
683
+ "665": "VERB|_|det",
684
+ "666": "VERB|_|discourse",
685
+ "667": "VERB|_|dislocated",
686
+ "668": "VERB|_|fixed",
687
+ "669": "VERB|_|flat",
688
+ "670": "VERB|_|list",
689
+ "671": "VERB|_|mark",
690
+ "672": "VERB|_|nmod",
691
+ "673": "VERB|_|nmod:poss",
692
+ "674": "VERB|_|nsubj",
693
+ "675": "VERB|_|obj",
694
+ "676": "VERB|_|obl",
695
+ "677": "VERB|_|obl:poss",
696
+ "678": "VERB|_|orphan",
697
+ "679": "VERB|_|parataxis",
698
+ "680": "VERB|_|punct",
699
+ "681": "VERB|_|root",
700
+ "682": "VERB|_|xcomp",
701
+ "683": "X|_|goeswith"
702
+ },
703
+ "initializer_range": 0.02,
704
+ "intermediate_size": 3072,
705
+ "label2id": {
706
+ "-|_|dep": 0,
707
+ "ADP|Foreign=Yes|case": 1,
708
+ "ADP|NounType=Class|case": 2,
709
+ "ADP|Prefix=Yes|case": 3,
710
+ "ADP|Prefix=Yes|mark": 4,
711
+ "ADP|_|acl": 5,
712
+ "ADP|_|advcl": 6,
713
+ "ADP|_|advmod": 7,
714
+ "ADP|_|appos": 8,
715
+ "ADP|_|case": 9,
716
+ "ADP|_|cc": 10,
717
+ "ADP|_|cc:preconj": 11,
718
+ "ADP|_|compound": 12,
719
+ "ADP|_|conj": 13,
720
+ "ADP|_|csubj": 14,
721
+ "ADP|_|dep": 15,
722
+ "ADP|_|fixed": 16,
723
+ "ADP|_|flat": 17,
724
+ "ADP|_|mark": 18,
725
+ "ADP|_|nmod": 19,
726
+ "ADP|_|nsubj": 20,
727
+ "ADP|_|obl": 21,
728
+ "ADP|_|orphan": 22,
729
+ "ADP|_|root": 23,
730
+ "ADV|Foreign=Yes|advmod": 24,
731
+ "ADV|NumType=Mult|advmod": 25,
732
+ "ADV|PartType=Adv|advmod": 26,
733
+ "ADV|PartType=Adv|mark": 27,
734
+ "ADV|PartType=Enp|advmod": 28,
735
+ "ADV|PartType=Int|advmod": 29,
736
+ "ADV|PartType=Int|fixed": 30,
737
+ "ADV|Prefix=Yes|advmod": 31,
738
+ "ADV|Prefix=Yes|mark": 32,
739
+ "ADV|PronType=Int|advmod": 33,
740
+ "ADV|_|acl": 34,
741
+ "ADV|_|advcl": 35,
742
+ "ADV|_|advmod": 36,
743
+ "ADV|_|aux": 37,
744
+ "ADV|_|case": 38,
745
+ "ADV|_|cc": 39,
746
+ "ADV|_|ccomp": 40,
747
+ "ADV|_|compound": 41,
748
+ "ADV|_|conj": 42,
749
+ "ADV|_|dep": 43,
750
+ "ADV|_|det": 44,
751
+ "ADV|_|discourse": 45,
752
+ "ADV|_|fixed": 46,
753
+ "ADV|_|flat": 47,
754
+ "ADV|_|mark": 48,
755
+ "ADV|_|nmod": 49,
756
+ "ADV|_|obj": 50,
757
+ "ADV|_|orphan": 51,
758
+ "ADV|_|root": 52,
759
+ "ADV|_|xcomp": 53,
760
+ "AUX|Foreign=Yes|aux": 54,
761
+ "AUX|NounType=Class|appos": 55,
762
+ "AUX|Prefix=Yes|aux": 56,
763
+ "AUX|VerbType=Cop|acl": 57,
764
+ "AUX|VerbType=Cop|advcl": 58,
765
+ "AUX|VerbType=Cop|aux": 59,
766
+ "AUX|VerbType=Cop|conj": 60,
767
+ "AUX|VerbType=Cop|cop": 61,
768
+ "AUX|VerbType=Cop|mark": 62,
769
+ "AUX|VerbType=Cop|root": 63,
770
+ "AUX|_|acl": 64,
771
+ "AUX|_|advmod": 65,
772
+ "AUX|_|aux": 66,
773
+ "AUX|_|aux:pass": 67,
774
+ "AUX|_|ccomp": 68,
775
+ "AUX|_|clf": 69,
776
+ "AUX|_|compound": 70,
777
+ "AUX|_|conj": 71,
778
+ "AUX|_|cop": 72,
779
+ "AUX|_|fixed": 73,
780
+ "AUX|_|mark": 74,
781
+ "AUX|_|root": 75,
782
+ "CCONJ|Foreign=Yes|cc": 76,
783
+ "CCONJ|PronType=Prs|cc": 77,
784
+ "CCONJ|_|advmod": 78,
785
+ "CCONJ|_|case": 79,
786
+ "CCONJ|_|cc": 80,
787
+ "CCONJ|_|compound": 81,
788
+ "CCONJ|_|conj": 82,
789
+ "CCONJ|_|discourse": 83,
790
+ "CCONJ|_|fixed": 84,
791
+ "CCONJ|_|flat": 85,
792
+ "CCONJ|_|mark": 86,
793
+ "CCONJ|_|nsubj": 87,
794
+ "CCONJ|_|obj": 88,
795
+ "CCONJ|_|obl": 89,
796
+ "CCONJ|_|orphan": 90,
797
+ "CCONJ|_|root": 91,
798
+ "DET|NumType=Mult|det": 92,
799
+ "DET|PartType=Emp|det": 93,
800
+ "DET|PartType=Int|det": 94,
801
+ "DET|PronType=Int|det": 95,
802
+ "DET|_|advmod": 96,
803
+ "DET|_|case": 97,
804
+ "DET|_|cc:preconj": 98,
805
+ "DET|_|compound": 99,
806
+ "DET|_|conj": 100,
807
+ "DET|_|dep": 101,
808
+ "DET|_|det": 102,
809
+ "DET|_|det:predet": 103,
810
+ "DET|_|discourse": 104,
811
+ "DET|_|fixed": 105,
812
+ "DET|_|flat": 106,
813
+ "DET|_|list": 107,
814
+ "DET|_|mark": 108,
815
+ "DET|_|nmod": 109,
816
+ "DET|_|nsubj": 110,
817
+ "DET|_|nsubj:pass": 111,
818
+ "DET|_|nummod": 112,
819
+ "DET|_|obj": 113,
820
+ "DET|_|obl": 114,
821
+ "DET|_|obl:tmod": 115,
822
+ "DET|_|orphan": 116,
823
+ "DET|_|root": 117,
824
+ "INTJ|_|acl": 118,
825
+ "INTJ|_|nsubj": 119,
826
+ "INTJ|_|root": 120,
827
+ "NOUN|Abbr=Yes|Foreign=Yes|nmod": 121,
828
+ "NOUN|Abbr=Yes|Prefix=Yes|flat": 122,
829
+ "NOUN|Abbr=Yes|acl": 123,
830
+ "NOUN|Abbr=Yes|appos": 124,
831
+ "NOUN|Abbr=Yes|clf": 125,
832
+ "NOUN|Abbr=Yes|conj": 126,
833
+ "NOUN|Abbr=Yes|fixed": 127,
834
+ "NOUN|Abbr=Yes|flat": 128,
835
+ "NOUN|Abbr=Yes|nmod": 129,
836
+ "NOUN|Abbr=Yes|nsubj": 130,
837
+ "NOUN|Abbr=Yes|obj": 131,
838
+ "NOUN|Abbr=Yes|obl": 132,
839
+ "NOUN|Foreign=Yes|NounType=Class|clf": 133,
840
+ "NOUN|Foreign=Yes|NounType=Class|obj": 134,
841
+ "NOUN|Foreign=Yes|Prefix=Yes|appos": 135,
842
+ "NOUN|Foreign=Yes|Prefix=Yes|flat": 136,
843
+ "NOUN|Foreign=Yes|acl": 137,
844
+ "NOUN|Foreign=Yes|advcl": 138,
845
+ "NOUN|Foreign=Yes|advmod": 139,
846
+ "NOUN|Foreign=Yes|appos": 140,
847
+ "NOUN|Foreign=Yes|ccomp": 141,
848
+ "NOUN|Foreign=Yes|clf": 142,
849
+ "NOUN|Foreign=Yes|compound": 143,
850
+ "NOUN|Foreign=Yes|conj": 144,
851
+ "NOUN|Foreign=Yes|dislocated": 145,
852
+ "NOUN|Foreign=Yes|flat": 146,
853
+ "NOUN|Foreign=Yes|iobj": 147,
854
+ "NOUN|Foreign=Yes|list": 148,
855
+ "NOUN|Foreign=Yes|nmod": 149,
856
+ "NOUN|Foreign=Yes|nsubj": 150,
857
+ "NOUN|Foreign=Yes|obj": 151,
858
+ "NOUN|Foreign=Yes|obl": 152,
859
+ "NOUN|Foreign=Yes|root": 153,
860
+ "NOUN|Foreign=Yes|xcomp": 154,
861
+ "NOUN|NameType=Com|nmod": 155,
862
+ "NOUN|NameType=Geo|nmod": 156,
863
+ "NOUN|NameType=Geo|nsubj": 157,
864
+ "NOUN|NameType=Geo|obj": 158,
865
+ "NOUN|NameType=Nat|nmod": 159,
866
+ "NOUN|NameType=Oth|conj": 160,
867
+ "NOUN|NameType=Oth|flat": 161,
868
+ "NOUN|NameType=Oth|nmod": 162,
869
+ "NOUN|NameType=Oth|nsubj": 163,
870
+ "NOUN|NameType=Pro|nmod": 164,
871
+ "NOUN|NameType=Prs|nmod": 165,
872
+ "NOUN|NameType=Prs|nsubj": 166,
873
+ "NOUN|NounType=Class|Prefix=Yes|advcl": 167,
874
+ "NOUN|NounType=Class|Prefix=Yes|advmod": 168,
875
+ "NOUN|NounType=Class|Prefix=Yes|clf": 169,
876
+ "NOUN|NounType=Class|Prefix=Yes|mark": 170,
877
+ "NOUN|NounType=Class|Prefix=Yes|nmod": 171,
878
+ "NOUN|NounType=Class|Prefix=Yes|nsubj": 172,
879
+ "NOUN|NounType=Class|Prefix=Yes|obj": 173,
880
+ "NOUN|NounType=Class|acl": 174,
881
+ "NOUN|NounType=Class|advcl": 175,
882
+ "NOUN|NounType=Class|advmod": 176,
883
+ "NOUN|NounType=Class|appos": 177,
884
+ "NOUN|NounType=Class|cc": 178,
885
+ "NOUN|NounType=Class|ccomp": 179,
886
+ "NOUN|NounType=Class|clf": 180,
887
+ "NOUN|NounType=Class|compound": 181,
888
+ "NOUN|NounType=Class|conj": 182,
889
+ "NOUN|NounType=Class|dislocated": 183,
890
+ "NOUN|NounType=Class|fixed": 184,
891
+ "NOUN|NounType=Class|flat": 185,
892
+ "NOUN|NounType=Class|iobj": 186,
893
+ "NOUN|NounType=Class|list": 187,
894
+ "NOUN|NounType=Class|nmod": 188,
895
+ "NOUN|NounType=Class|nsubj": 189,
896
+ "NOUN|NounType=Class|nummod": 190,
897
+ "NOUN|NounType=Class|obj": 191,
898
+ "NOUN|NounType=Class|obl": 192,
899
+ "NOUN|NounType=Class|orphan": 193,
900
+ "NOUN|NounType=Class|root": 194,
901
+ "NOUN|NounType=Class|xcomp": 195,
902
+ "NOUN|NumType=Mult|advcl": 196,
903
+ "NOUN|NumType=Mult|nmod": 197,
904
+ "NOUN|NumType=Mult|obj": 198,
905
+ "NOUN|PartType=Enp|obj": 199,
906
+ "NOUN|PartType=Enp|obl": 200,
907
+ "NOUN|PartType=Int|obj": 201,
908
+ "NOUN|PartType=Res|nmod": 202,
909
+ "NOUN|PartType=Res|obj": 203,
910
+ "NOUN|Prefix=Yes|acl": 204,
911
+ "NOUN|Prefix=Yes|advcl": 205,
912
+ "NOUN|Prefix=Yes|advmod": 206,
913
+ "NOUN|Prefix=Yes|appos": 207,
914
+ "NOUN|Prefix=Yes|case": 208,
915
+ "NOUN|Prefix=Yes|cc": 209,
916
+ "NOUN|Prefix=Yes|ccomp": 210,
917
+ "NOUN|Prefix=Yes|clf": 211,
918
+ "NOUN|Prefix=Yes|compound": 212,
919
+ "NOUN|Prefix=Yes|conj": 213,
920
+ "NOUN|Prefix=Yes|csubj": 214,
921
+ "NOUN|Prefix=Yes|dislocated": 215,
922
+ "NOUN|Prefix=Yes|fixed": 216,
923
+ "NOUN|Prefix=Yes|flat": 217,
924
+ "NOUN|Prefix=Yes|iobj": 218,
925
+ "NOUN|Prefix=Yes|list": 219,
926
+ "NOUN|Prefix=Yes|nmod": 220,
927
+ "NOUN|Prefix=Yes|nsubj": 221,
928
+ "NOUN|Prefix=Yes|nummod": 222,
929
+ "NOUN|Prefix=Yes|obj": 223,
930
+ "NOUN|Prefix=Yes|obl": 224,
931
+ "NOUN|Prefix=Yes|orphan": 225,
932
+ "NOUN|Prefix=Yes|root": 226,
933
+ "NOUN|Prefix=Yes|xcomp": 227,
934
+ "NOUN|_|acl": 228,
935
+ "NOUN|_|acl:relcl": 229,
936
+ "NOUN|_|advcl": 230,
937
+ "NOUN|_|advmod": 231,
938
+ "NOUN|_|appos": 232,
939
+ "NOUN|_|aux": 233,
940
+ "NOUN|_|case": 234,
941
+ "NOUN|_|cc": 235,
942
+ "NOUN|_|ccomp": 236,
943
+ "NOUN|_|clf": 237,
944
+ "NOUN|_|compound": 238,
945
+ "NOUN|_|conj": 239,
946
+ "NOUN|_|cop": 240,
947
+ "NOUN|_|csubj": 241,
948
+ "NOUN|_|discourse": 242,
949
+ "NOUN|_|dislocated": 243,
950
+ "NOUN|_|expl": 244,
951
+ "NOUN|_|fixed": 245,
952
+ "NOUN|_|flat": 246,
953
+ "NOUN|_|flat:name": 247,
954
+ "NOUN|_|iobj": 248,
955
+ "NOUN|_|list": 249,
956
+ "NOUN|_|mark": 250,
957
+ "NOUN|_|nmod": 251,
958
+ "NOUN|_|nmod:poss": 252,
959
+ "NOUN|_|nsubj": 253,
960
+ "NOUN|_|nsubj:pass": 254,
961
+ "NOUN|_|nummod": 255,
962
+ "NOUN|_|obj": 256,
963
+ "NOUN|_|obl": 257,
964
+ "NOUN|_|obl:poss": 258,
965
+ "NOUN|_|obl:tmod": 259,
966
+ "NOUN|_|orphan": 260,
967
+ "NOUN|_|parataxis": 261,
968
+ "NOUN|_|root": 262,
969
+ "NOUN|_|vocative": 263,
970
+ "NOUN|_|xcomp": 264,
971
+ "NUM|Abbr=Yes|flat": 265,
972
+ "NUM|Abbr=Yes|nummod": 266,
973
+ "NUM|Abbr=Yes|obj": 267,
974
+ "NUM|Foreign=Yes|clf": 268,
975
+ "NUM|NumType=Mult|advmod": 269,
976
+ "NUM|NumType=Mult|nummod": 270,
977
+ "NUM|Prefix=Yes|nummod": 271,
978
+ "NUM|_|acl": 272,
979
+ "NUM|_|acl:relcl": 273,
980
+ "NUM|_|advcl": 274,
981
+ "NUM|_|advmod": 275,
982
+ "NUM|_|appos": 276,
983
+ "NUM|_|case": 277,
984
+ "NUM|_|ccomp": 278,
985
+ "NUM|_|clf": 279,
986
+ "NUM|_|compound": 280,
987
+ "NUM|_|conj": 281,
988
+ "NUM|_|dep": 282,
989
+ "NUM|_|det": 283,
990
+ "NUM|_|fixed": 284,
991
+ "NUM|_|flat": 285,
992
+ "NUM|_|flat:name": 286,
993
+ "NUM|_|iobj": 287,
994
+ "NUM|_|nmod": 288,
995
+ "NUM|_|nsubj": 289,
996
+ "NUM|_|nummod": 290,
997
+ "NUM|_|obj": 291,
998
+ "NUM|_|obl": 292,
999
+ "NUM|_|obl:poss": 293,
1000
+ "NUM|_|obl:tmod": 294,
1001
+ "NUM|_|root": 295,
1002
+ "NUM|_|xcomp": 296,
1003
+ "PART|Aspect=Perf|aux": 297,
1004
+ "PART|Aspect=Perf|xcomp": 298,
1005
+ "PART|Aspect=Prog|aux": 299,
1006
+ "PART|NameType=Oth|advmod": 300,
1007
+ "PART|NounType=Class|PartType=Emp|Prefix=Yes|mark": 301,
1008
+ "PART|NounType=Class|PartType=Emp|mark": 302,
1009
+ "PART|NounType=Class|Prefix=Yes|mark": 303,
1010
+ "PART|NumType=Mult|PartType=Emp|mark": 304,
1011
+ "PART|PartType=Adj|acl": 305,
1012
+ "PART|PartType=Adj|compound": 306,
1013
+ "PART|PartType=Adj|mark": 307,
1014
+ "PART|PartType=Adj|nmod": 308,
1015
+ "PART|PartType=Adj|orphan": 309,
1016
+ "PART|PartType=Adv|advmod": 310,
1017
+ "PART|PartType=Adv|mark": 311,
1018
+ "PART|PartType=Emp|Prefix=Yes|advmod": 312,
1019
+ "PART|PartType=Emp|Prefix=Yes|aux": 313,
1020
+ "PART|PartType=Emp|Prefix=Yes|mark": 314,
1021
+ "PART|PartType=Emp|acl": 315,
1022
+ "PART|PartType=Emp|advmod": 316,
1023
+ "PART|PartType=Emp|aux": 317,
1024
+ "PART|PartType=Emp|case": 318,
1025
+ "PART|PartType=Emp|compound": 319,
1026
+ "PART|PartType=Emp|det": 320,
1027
+ "PART|PartType=Emp|discourse": 321,
1028
+ "PART|PartType=Emp|fixed": 322,
1029
+ "PART|PartType=Emp|mark": 323,
1030
+ "PART|PartType=Emp|nmod": 324,
1031
+ "PART|PartType=Enp|acl": 325,
1032
+ "PART|PartType=Enp|advmod": 326,
1033
+ "PART|PartType=Enp|compound": 327,
1034
+ "PART|PartType=Enp|dep": 328,
1035
+ "PART|PartType=Enp|det": 329,
1036
+ "PART|PartType=Enp|discourse": 330,
1037
+ "PART|PartType=Enp|fixed": 331,
1038
+ "PART|PartType=Enp|obl": 332,
1039
+ "PART|PartType=Int|acl": 333,
1040
+ "PART|PartType=Int|advmod": 334,
1041
+ "PART|PartType=Int|dep": 335,
1042
+ "PART|PartType=Int|discourse": 336,
1043
+ "PART|PartType=Int|mark": 337,
1044
+ "PART|PartType=Int|nmod": 338,
1045
+ "PART|PartType=Int|obj": 339,
1046
+ "PART|PartType=Int|obl": 340,
1047
+ "PART|PartType=Neg|acl": 341,
1048
+ "PART|PartType=Neg|advcl": 342,
1049
+ "PART|PartType=Neg|advmod": 343,
1050
+ "PART|PartType=Neg|aux": 344,
1051
+ "PART|PartType=Neg|fixed": 345,
1052
+ "PART|PartType=Neg|mark": 346,
1053
+ "PART|PartType=Res|advmod": 347,
1054
+ "PART|PartType=Res|discourse": 348,
1055
+ "PART|PartType=Res|fixed": 349,
1056
+ "PART|Polarity=Neg|advmod": 350,
1057
+ "PART|Prefix=Yes|acl": 351,
1058
+ "PART|Prefix=Yes|advmod": 352,
1059
+ "PART|Prefix=Yes|aux": 353,
1060
+ "PART|Prefix=Yes|mark": 354,
1061
+ "PART|Prefix=Yes|nmod": 355,
1062
+ "PART|PronType=Int|acl": 356,
1063
+ "PART|PronType=Int|advmod": 357,
1064
+ "PART|PronType=Int|discourse": 358,
1065
+ "PART|PronType=Int|obj": 359,
1066
+ "PART|PronType=Int|root": 360,
1067
+ "PART|_|acl": 361,
1068
+ "PART|_|advmod": 362,
1069
+ "PART|_|aux": 363,
1070
+ "PART|_|cc": 364,
1071
+ "PART|_|cc:preconj": 365,
1072
+ "PART|_|ccomp": 366,
1073
+ "PART|_|clf": 367,
1074
+ "PART|_|compound": 368,
1075
+ "PART|_|compound:prt": 369,
1076
+ "PART|_|conj": 370,
1077
+ "PART|_|discourse": 371,
1078
+ "PART|_|fixed": 372,
1079
+ "PART|_|mark": 373,
1080
+ "PART|_|nmod": 374,
1081
+ "PART|_|nmod:poss": 375,
1082
+ "PART|_|nsubj": 376,
1083
+ "PART|_|obj": 377,
1084
+ "PART|_|obl": 378,
1085
+ "PART|_|root": 379,
1086
+ "PRON|NounType=Class|clf": 380,
1087
+ "PRON|Person=1|compound": 381,
1088
+ "PRON|Person=1|nmod:poss": 382,
1089
+ "PRON|Person=1|nsubj": 383,
1090
+ "PRON|Person=1|nsubj:pass": 384,
1091
+ "PRON|Person=1|obj": 385,
1092
+ "PRON|Person=1|obl": 386,
1093
+ "PRON|Person=1|obl:poss": 387,
1094
+ "PRON|Person=2|compound": 388,
1095
+ "PRON|Person=2|nmod:poss": 389,
1096
+ "PRON|Person=2|nsubj": 390,
1097
+ "PRON|Person=2|obj": 391,
1098
+ "PRON|Person=2|obl": 392,
1099
+ "PRON|Person=3|advmod": 393,
1100
+ "PRON|Person=3|appos": 394,
1101
+ "PRON|Person=3|compound": 395,
1102
+ "PRON|Person=3|conj": 396,
1103
+ "PRON|Person=3|nmod": 397,
1104
+ "PRON|Person=3|nmod:poss": 398,
1105
+ "PRON|Person=3|nsubj": 399,
1106
+ "PRON|Person=3|nsubj:pass": 400,
1107
+ "PRON|Person=3|obj": 401,
1108
+ "PRON|Person=3|obl": 402,
1109
+ "PRON|Person=3|obl:poss": 403,
1110
+ "PRON|Person=3|reparandum": 404,
1111
+ "PRON|Person=3|xcomp": 405,
1112
+ "PRON|PronType=Int|nsubj": 406,
1113
+ "PRON|PronType=Int|obj": 407,
1114
+ "PRON|PronType=Int|obl": 408,
1115
+ "PRON|PronType=Int|root": 409,
1116
+ "PRON|PronType=Prs|advcl": 410,
1117
+ "PRON|PronType=Prs|advmod": 411,
1118
+ "PRON|PronType=Prs|ccomp": 412,
1119
+ "PRON|PronType=Prs|clf": 413,
1120
+ "PRON|PronType=Prs|conj": 414,
1121
+ "PRON|PronType=Prs|expl": 415,
1122
+ "PRON|PronType=Prs|nmod": 416,
1123
+ "PRON|PronType=Prs|nsubj": 417,
1124
+ "PRON|PronType=Prs|obj": 418,
1125
+ "PRON|PronType=Prs|obl": 419,
1126
+ "PRON|PronType=Prs|root": 420,
1127
+ "PRON|PronType=Rcp|advmod": 421,
1128
+ "PRON|PronType=Rcp|iobj": 422,
1129
+ "PRON|PronType=Rcp|nmod": 423,
1130
+ "PRON|PronType=Rcp|obj": 424,
1131
+ "PRON|PronType=Rcp|obl": 425,
1132
+ "PRON|_|acl": 426,
1133
+ "PRON|_|acl:relcl": 427,
1134
+ "PRON|_|advcl": 428,
1135
+ "PRON|_|advmod": 429,
1136
+ "PRON|_|appos": 430,
1137
+ "PRON|_|ccomp": 431,
1138
+ "PRON|_|compound": 432,
1139
+ "PRON|_|conj": 433,
1140
+ "PRON|_|csubj": 434,
1141
+ "PRON|_|det": 435,
1142
+ "PRON|_|discourse": 436,
1143
+ "PRON|_|dislocated": 437,
1144
+ "PRON|_|expl": 438,
1145
+ "PRON|_|fixed": 439,
1146
+ "PRON|_|flat": 440,
1147
+ "PRON|_|iobj": 441,
1148
+ "PRON|_|mark": 442,
1149
+ "PRON|_|nmod": 443,
1150
+ "PRON|_|nmod:poss": 444,
1151
+ "PRON|_|nsubj": 445,
1152
+ "PRON|_|obj": 446,
1153
+ "PRON|_|obl": 447,
1154
+ "PRON|_|obl:poss": 448,
1155
+ "PRON|_|root": 449,
1156
+ "PRON|_|xcomp": 450,
1157
+ "PROPN|Abbr=Yes|Foreign=Yes|NameType=Oth|obj": 451,
1158
+ "PROPN|Abbr=Yes|NameType=Com|advmod": 452,
1159
+ "PROPN|Abbr=Yes|NameType=Com|nmod": 453,
1160
+ "PROPN|Abbr=Yes|nmod": 454,
1161
+ "PROPN|Abbr=Yes|nsubj": 455,
1162
+ "PROPN|Foreign=Yes|NameType=Com|list": 456,
1163
+ "PROPN|Foreign=Yes|NameType=Com|nmod": 457,
1164
+ "PROPN|Foreign=Yes|NameType=Com|nsubj": 458,
1165
+ "PROPN|Foreign=Yes|NameType=Com|obl": 459,
1166
+ "PROPN|Foreign=Yes|NameType=Geo|obj": 460,
1167
+ "PROPN|Foreign=Yes|NameType=Geo|obl": 461,
1168
+ "PROPN|Foreign=Yes|NameType=Giv|nsubj": 462,
1169
+ "PROPN|Foreign=Yes|NameType=Oth|conj": 463,
1170
+ "PROPN|Foreign=Yes|NameType=Oth|flat": 464,
1171
+ "PROPN|Foreign=Yes|NameType=Oth|nmod": 465,
1172
+ "PROPN|Foreign=Yes|NameType=Prs|conj": 466,
1173
+ "PROPN|Foreign=Yes|NameType=Prs|flat": 467,
1174
+ "PROPN|Foreign=Yes|NameType=Prs|nmod": 468,
1175
+ "PROPN|Foreign=Yes|NameType=Prs|nsubj": 469,
1176
+ "PROPN|Foreign=Yes|NameType=Prs|obj": 470,
1177
+ "PROPN|Foreign=Yes|NameType=Prs|obl": 471,
1178
+ "PROPN|Foreign=Yes|NameType=Sur|flat": 472,
1179
+ "PROPN|Foreign=Yes|appos": 473,
1180
+ "PROPN|Foreign=Yes|ccomp": 474,
1181
+ "PROPN|Foreign=Yes|compound": 475,
1182
+ "PROPN|Foreign=Yes|conj": 476,
1183
+ "PROPN|Foreign=Yes|flat": 477,
1184
+ "PROPN|Foreign=Yes|iobj": 478,
1185
+ "PROPN|Foreign=Yes|list": 479,
1186
+ "PROPN|Foreign=Yes|nmod": 480,
1187
+ "PROPN|Foreign=Yes|nsubj": 481,
1188
+ "PROPN|Foreign=Yes|obj": 482,
1189
+ "PROPN|Foreign=Yes|obl": 483,
1190
+ "PROPN|Foreign=Yes|root": 484,
1191
+ "PROPN|NameType=Com|appos": 485,
1192
+ "PROPN|NameType=Com|conj": 486,
1193
+ "PROPN|NameType=Com|flat": 487,
1194
+ "PROPN|NameType=Com|list": 488,
1195
+ "PROPN|NameType=Com|nmod": 489,
1196
+ "PROPN|NameType=Com|nsubj": 490,
1197
+ "PROPN|NameType=Com|obj": 491,
1198
+ "PROPN|NameType=Com|obl": 492,
1199
+ "PROPN|NameType=Geo|compound": 493,
1200
+ "PROPN|NameType=Geo|conj": 494,
1201
+ "PROPN|NameType=Geo|flat": 495,
1202
+ "PROPN|NameType=Geo|list": 496,
1203
+ "PROPN|NameType=Geo|nmod": 497,
1204
+ "PROPN|NameType=Geo|nsubj": 498,
1205
+ "PROPN|NameType=Geo|nummod": 499,
1206
+ "PROPN|NameType=Geo|obj": 500,
1207
+ "PROPN|NameType=Geo|obl": 501,
1208
+ "PROPN|NameType=Geo|root": 502,
1209
+ "PROPN|NameType=Giv|acl": 503,
1210
+ "PROPN|NameType=Giv|appos": 504,
1211
+ "PROPN|NameType=Giv|ccomp": 505,
1212
+ "PROPN|NameType=Giv|conj": 506,
1213
+ "PROPN|NameType=Giv|dislocated": 507,
1214
+ "PROPN|NameType=Giv|flat": 508,
1215
+ "PROPN|NameType=Giv|list": 509,
1216
+ "PROPN|NameType=Giv|nmod": 510,
1217
+ "PROPN|NameType=Giv|nsubj": 511,
1218
+ "PROPN|NameType=Giv|obj": 512,
1219
+ "PROPN|NameType=Giv|obl": 513,
1220
+ "PROPN|NameType=Giv|root": 514,
1221
+ "PROPN|NameType=Nat|acl": 515,
1222
+ "PROPN|NameType=Nat|appos": 516,
1223
+ "PROPN|NameType=Nat|compound": 517,
1224
+ "PROPN|NameType=Nat|conj": 518,
1225
+ "PROPN|NameType=Nat|csubj": 519,
1226
+ "PROPN|NameType=Nat|flat": 520,
1227
+ "PROPN|NameType=Nat|list": 521,
1228
+ "PROPN|NameType=Nat|nmod": 522,
1229
+ "PROPN|NameType=Nat|nsubj": 523,
1230
+ "PROPN|NameType=Nat|nummod": 524,
1231
+ "PROPN|NameType=Nat|obj": 525,
1232
+ "PROPN|NameType=Nat|obl": 526,
1233
+ "PROPN|NameType=Oth|acl": 527,
1234
+ "PROPN|NameType=Oth|appos": 528,
1235
+ "PROPN|NameType=Oth|compound": 529,
1236
+ "PROPN|NameType=Oth|conj": 530,
1237
+ "PROPN|NameType=Oth|dislocated": 531,
1238
+ "PROPN|NameType=Oth|flat": 532,
1239
+ "PROPN|NameType=Oth|nmod": 533,
1240
+ "PROPN|NameType=Oth|nsubj": 534,
1241
+ "PROPN|NameType=Oth|obj": 535,
1242
+ "PROPN|NameType=Oth|obl": 536,
1243
+ "PROPN|NameType=Oth|root": 537,
1244
+ "PROPN|NameType=Pro|advcl": 538,
1245
+ "PROPN|NameType=Pro|flat": 539,
1246
+ "PROPN|NameType=Pro|nmod": 540,
1247
+ "PROPN|NameType=Pro|nsubj": 541,
1248
+ "PROPN|NameType=Pro|obj": 542,
1249
+ "PROPN|NameType=Pro|obl": 543,
1250
+ "PROPN|NameType=Prs|conj": 544,
1251
+ "PROPN|NameType=Prs|discourse": 545,
1252
+ "PROPN|NameType=Prs|dislocated": 546,
1253
+ "PROPN|NameType=Prs|flat": 547,
1254
+ "PROPN|NameType=Prs|list": 548,
1255
+ "PROPN|NameType=Prs|nmod": 549,
1256
+ "PROPN|NameType=Prs|nsubj": 550,
1257
+ "PROPN|NameType=Prs|obj": 551,
1258
+ "PROPN|NameType=Prs|obl": 552,
1259
+ "PROPN|NameType=Prs|vocative": 553,
1260
+ "PROPN|NameType=Sur|flat": 554,
1261
+ "PROPN|NameType=Sur|nmod": 555,
1262
+ "PROPN|NameType=Sur|nsubj": 556,
1263
+ "PROPN|NounType=Class|clf": 557,
1264
+ "PROPN|Prefix=Yes|nmod": 558,
1265
+ "PROPN|Prefix=Yes|nsubj": 559,
1266
+ "PROPN|_|acl": 560,
1267
+ "PROPN|_|acl:relcl": 561,
1268
+ "PROPN|_|advmod": 562,
1269
+ "PROPN|_|appos": 563,
1270
+ "PROPN|_|aux": 564,
1271
+ "PROPN|_|cc": 565,
1272
+ "PROPN|_|ccomp": 566,
1273
+ "PROPN|_|clf": 567,
1274
+ "PROPN|_|compound": 568,
1275
+ "PROPN|_|conj": 569,
1276
+ "PROPN|_|fixed": 570,
1277
+ "PROPN|_|flat": 571,
1278
+ "PROPN|_|flat:name": 572,
1279
+ "PROPN|_|goeswith": 573,
1280
+ "PROPN|_|iobj": 574,
1281
+ "PROPN|_|list": 575,
1282
+ "PROPN|_|nmod": 576,
1283
+ "PROPN|_|nmod:poss": 577,
1284
+ "PROPN|_|nsubj": 578,
1285
+ "PROPN|_|nsubj:pass": 579,
1286
+ "PROPN|_|obj": 580,
1287
+ "PROPN|_|obl": 581,
1288
+ "PROPN|_|obl:poss": 582,
1289
+ "PROPN|_|obl:tmod": 583,
1290
+ "PROPN|_|root": 584,
1291
+ "PROPN|_|xcomp": 585,
1292
+ "PUNCT|NounType=Class|punct": 586,
1293
+ "PUNCT|_|advmod": 587,
1294
+ "PUNCT|_|clf": 588,
1295
+ "PUNCT|_|dep": 589,
1296
+ "PUNCT|_|punct": 590,
1297
+ "PUNCT|_|root": 591,
1298
+ "SCONJ|NumType=Mult|mark": 592,
1299
+ "SCONJ|Prefix=Yes|cc": 593,
1300
+ "SCONJ|Prefix=Yes|mark": 594,
1301
+ "SCONJ|VerbType=Cop|mark": 595,
1302
+ "SCONJ|_|advcl": 596,
1303
+ "SCONJ|_|advmod": 597,
1304
+ "SCONJ|_|case": 598,
1305
+ "SCONJ|_|cc": 599,
1306
+ "SCONJ|_|compound": 600,
1307
+ "SCONJ|_|discourse": 601,
1308
+ "SCONJ|_|fixed": 602,
1309
+ "SCONJ|_|flat": 603,
1310
+ "SCONJ|_|mark": 604,
1311
+ "SCONJ|_|nsubj": 605,
1312
+ "SCONJ|_|orphan": 606,
1313
+ "SCONJ|_|root": 607,
1314
+ "SYM|_|advmod": 608,
1315
+ "SYM|_|clf": 609,
1316
+ "SYM|_|dep": 610,
1317
+ "SYM|_|nmod": 611,
1318
+ "SYM|_|nsubj": 612,
1319
+ "SYM|_|obj": 613,
1320
+ "SYM|_|obl": 614,
1321
+ "SYM|_|xcomp": 615,
1322
+ "VERB|Abbr=Yes|acl": 616,
1323
+ "VERB|Foreign=Yes|acl": 617,
1324
+ "VERB|Foreign=Yes|advcl": 618,
1325
+ "VERB|Foreign=Yes|ccomp": 619,
1326
+ "VERB|Foreign=Yes|compound": 620,
1327
+ "VERB|Foreign=Yes|conj": 621,
1328
+ "VERB|Foreign=Yes|flat": 622,
1329
+ "VERB|Foreign=Yes|nmod": 623,
1330
+ "VERB|Foreign=Yes|nsubj": 624,
1331
+ "VERB|Foreign=Yes|root": 625,
1332
+ "VERB|Foreign=Yes|xcomp": 626,
1333
+ "VERB|Mood=Imp|aux": 627,
1334
+ "VERB|Mood=Imp|xcomp": 628,
1335
+ "VERB|NounType=Class|acl": 629,
1336
+ "VERB|NounType=Class|compound": 630,
1337
+ "VERB|PartType=Adj|acl": 631,
1338
+ "VERB|Prefix=Yes|acl": 632,
1339
+ "VERB|Prefix=Yes|advcl": 633,
1340
+ "VERB|Prefix=Yes|ccomp": 634,
1341
+ "VERB|Prefix=Yes|compound": 635,
1342
+ "VERB|Prefix=Yes|conj": 636,
1343
+ "VERB|Prefix=Yes|nsubj": 637,
1344
+ "VERB|Prefix=Yes|parataxis": 638,
1345
+ "VERB|Prefix=Yes|root": 639,
1346
+ "VERB|VerbType=Cop|acl": 640,
1347
+ "VERB|VerbType=Cop|advcl": 641,
1348
+ "VERB|VerbType=Cop|advmod": 642,
1349
+ "VERB|VerbType=Cop|ccomp": 643,
1350
+ "VERB|VerbType=Cop|compound": 644,
1351
+ "VERB|VerbType=Cop|cop": 645,
1352
+ "VERB|VerbType=Cop|parataxis": 646,
1353
+ "VERB|VerbType=Cop|root": 647,
1354
+ "VERB|Voice=Pass|aux": 648,
1355
+ "VERB|Voice=Pass|aux:pass": 649,
1356
+ "VERB|_|acl": 650,
1357
+ "VERB|_|acl:relcl": 651,
1358
+ "VERB|_|advcl": 652,
1359
+ "VERB|_|advmod": 653,
1360
+ "VERB|_|appos": 654,
1361
+ "VERB|_|aux": 655,
1362
+ "VERB|_|case": 656,
1363
+ "VERB|_|cc": 657,
1364
+ "VERB|_|ccomp": 658,
1365
+ "VERB|_|clf": 659,
1366
+ "VERB|_|compound": 660,
1367
+ "VERB|_|conj": 661,
1368
+ "VERB|_|cop": 662,
1369
+ "VERB|_|csubj": 663,
1370
+ "VERB|_|dep": 664,
1371
+ "VERB|_|det": 665,
1372
+ "VERB|_|discourse": 666,
1373
+ "VERB|_|dislocated": 667,
1374
+ "VERB|_|fixed": 668,
1375
+ "VERB|_|flat": 669,
1376
+ "VERB|_|list": 670,
1377
+ "VERB|_|mark": 671,
1378
+ "VERB|_|nmod": 672,
1379
+ "VERB|_|nmod:poss": 673,
1380
+ "VERB|_|nsubj": 674,
1381
+ "VERB|_|obj": 675,
1382
+ "VERB|_|obl": 676,
1383
+ "VERB|_|obl:poss": 677,
1384
+ "VERB|_|orphan": 678,
1385
+ "VERB|_|parataxis": 679,
1386
+ "VERB|_|punct": 680,
1387
+ "VERB|_|root": 681,
1388
+ "VERB|_|xcomp": 682,
1389
+ "X|_|goeswith": 683
1390
+ },
1391
+ "layer_norm_eps": 1e-12,
1392
+ "max_position_embeddings": 512,
1393
+ "model_type": "camembert",
1394
+ "num_attention_heads": 12,
1395
+ "num_hidden_layers": 12,
1396
+ "pad_token_id": 1,
1397
+ "position_embedding_type": "absolute",
1398
+ "tokenizer_class": "PreTrainedTokenizerFast",
1399
+ "torch_dtype": "float32",
1400
+ "transformers_version": "4.44.1",
1401
+ "type_vocab_size": 1,
1402
+ "use_cache": true,
1403
+ "vocab_size": 249262
1404
+ }
maker.py ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ #! /usr/bin/python3
2
+ src="KoichiYasuoka/camembert-thai-base-upos"
3
+ tgt="KoichiYasuoka/camembert-thai-base-ud-goeswith"
4
+ url="https://github.com/KoichiYasuoka/spaCy-Thai"
5
+ import os
6
+ d=os.path.join(os.path.basename(url),"UD_Thai-Corpora")
7
+ os.system(f"test -d {d} || git clone --depth=1 {url}")
8
+ s='{if(NF>0)u=u$0"\\n";else{f=FILENAME;if(u~/\\t0\\troot\\t/)print u>(f~/-dev/?"dev":f~/-test/?"test":"train")".conllu";u=""}}'
9
+ os.system(f"nawk -F'\\t' '{s}' {d}/*-ud-*.conllu")
10
+ class UDgoeswithDataset(object):
11
+ def __init__(self,conllu,tokenizer):
12
+ self.ids,self.tags,label=[],[],set()
13
+ with open(conllu,"r",encoding="utf-8") as r:
14
+ cls,sep,msk=tokenizer.cls_token_id,tokenizer.sep_token_id,tokenizer.mask_token_id
15
+ dep,c="-|_|dep",[]
16
+ for s in r:
17
+ t=s.split("\t")
18
+ if len(t)==10:
19
+ if t[0].isdecimal():
20
+ c.append(t)
21
+ elif c!=[]:
22
+ v=tokenizer([t[1] for t in c],add_special_tokens=False)["input_ids"]
23
+ for i in range(len(v)-1,-1,-1):
24
+ for j in range(1,len(v[i])):
25
+ c.insert(i+1,[c[i][0],"_","_","X","_","_",c[i][0],"goeswith","_","_"])
26
+ y=["0"]+[t[0] for t in c]
27
+ h=[i if t[6]=="0" else y.index(t[6]) for i,t in enumerate(c,1)]
28
+ p,v=[t[3]+"|"+t[5]+"|"+t[7] for t in c],sum(v,[])
29
+ self.ids.append([cls]+v+[sep])
30
+ self.tags.append([dep]+p+[dep])
31
+ label=set(sum([self.tags[-1],list(label)],[]))
32
+ for i,k in enumerate(v):
33
+ self.ids.append([cls]+v[0:i]+[msk]+v[i+1:]+[sep,k])
34
+ self.tags.append([dep]+[t if h[j]==i+1 else dep for j,t in enumerate(p)]+[dep,dep])
35
+ c=[]
36
+ self.label2id={l:i for i,l in enumerate(sorted(label))}
37
+ def __call__(*args):
38
+ label=set(sum([list(t.label2id) for t in args],[]))
39
+ lid={l:i for i,l in enumerate(sorted(label))}
40
+ for t in args:
41
+ t.label2id=lid
42
+ return lid
43
+ __len__=lambda self:len(self.ids)
44
+ __getitem__=lambda self,i:{"input_ids":self.ids[i],"labels":[self.label2id[t] for t in self.tags[i]]}
45
+ from transformers import AutoTokenizer,AutoConfig,AutoModelForTokenClassification,DataCollatorForTokenClassification,TrainingArguments,Trainer
46
+ tkz=AutoTokenizer.from_pretrained(src)
47
+ trainDS=UDgoeswithDataset("train.conllu",tkz)
48
+ devDS=UDgoeswithDataset("dev.conllu",tkz)
49
+ testDS=UDgoeswithDataset("test.conllu",tkz)
50
+ lid=trainDS(devDS,testDS)
51
+ cfg=AutoConfig.from_pretrained(src,num_labels=len(lid),label2id=lid,id2label={i:l for l,i in lid.items()},ignore_mismatched_sizes=True)
52
+ arg=TrainingArguments(num_train_epochs=3,per_device_train_batch_size=32,output_dir="/tmp",overwrite_output_dir=True,save_total_limit=2,evaluation_strategy="epoch",learning_rate=5e-05,warmup_ratio=0.1,save_safetensors=False)
53
+ trn=Trainer(args=arg,data_collator=DataCollatorForTokenClassification(tkz),model=AutoModelForTokenClassification.from_pretrained(src,config=cfg,ignore_mismatched_sizes=True),train_dataset=trainDS,eval_dataset=devDS)
54
+ trn.train()
55
+ trn.save_model(tgt)
56
+ tkz.save_pretrained(tgt)
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3dc4c5b9ef4a4c83e720378c397595e86b73ef4c3f9ff4eeed36847afa687285
3
+ size 1109705510
special_tokens_map.json ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "additional_special_tokens": [
3
+ "<s>NOTUSED",
4
+ "</s>NOTUSED",
5
+ "<_>"
6
+ ],
7
+ "bos_token": {
8
+ "content": "<s>",
9
+ "lstrip": false,
10
+ "normalized": false,
11
+ "rstrip": false,
12
+ "single_word": false
13
+ },
14
+ "cls_token": {
15
+ "content": "<s>",
16
+ "lstrip": false,
17
+ "normalized": false,
18
+ "rstrip": false,
19
+ "single_word": false
20
+ },
21
+ "eos_token": {
22
+ "content": "</s>",
23
+ "lstrip": false,
24
+ "normalized": false,
25
+ "rstrip": false,
26
+ "single_word": false
27
+ },
28
+ "mask_token": {
29
+ "content": "<mask>",
30
+ "lstrip": true,
31
+ "normalized": true,
32
+ "rstrip": false,
33
+ "single_word": false
34
+ },
35
+ "pad_token": {
36
+ "content": "<pad>",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false
41
+ },
42
+ "sep_token": {
43
+ "content": "</s>",
44
+ "lstrip": false,
45
+ "normalized": false,
46
+ "rstrip": false,
47
+ "single_word": false
48
+ },
49
+ "unk_token": {
50
+ "content": "<unk>",
51
+ "lstrip": false,
52
+ "normalized": false,
53
+ "rstrip": false,
54
+ "single_word": false
55
+ }
56
+ }
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ecc3f0ca8f095a799ca3c58a2b733176c39bd5f17cab15cd1596592644a7b3c9
3
+ size 16917473
tokenizer_config.json ADDED
The diff for this file is too large to render. See raw diff
 
ud.py ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ from transformers import TokenClassificationPipeline
2
+
3
+ class UniversalDependenciesPipeline(TokenClassificationPipeline):
4
+ def _forward(self,model_inputs):
5
+ import torch
6
+ v=model_inputs["input_ids"][0].tolist()
7
+ with torch.no_grad():
8
+ e=self.model(input_ids=torch.tensor([v[0:i]+[self.tokenizer.mask_token_id]+v[i+1:]+[j] for i,j in enumerate(v[1:-1],1)],device=self.device))
9
+ return {"logits":e.logits[:,1:-2,:],**model_inputs}
10
+ def postprocess(self,model_outputs,**kwargs):
11
+ import numpy
12
+ if "logits" not in model_outputs:
13
+ return "".join(self.postprocess(x,**kwargs) for x in model_outputs)
14
+ e=model_outputs["logits"].numpy()
15
+ r=[1 if i==0 else -1 if j.endswith("|root") else 0 for i,j in sorted(self.model.config.id2label.items())]
16
+ e+=numpy.where(numpy.add.outer(numpy.identity(e.shape[0]),r)==0,0,numpy.nan)
17
+ g=self.model.config.label2id["X|_|goeswith"]
18
+ r=numpy.tri(e.shape[0])
19
+ for i in range(e.shape[0]):
20
+ for j in range(i+2,e.shape[1]):
21
+ r[i,j]=r[i,j-1] if numpy.nanargmax(e[i,j-1])==g else 1
22
+ e[:,:,g]+=numpy.where(r==0,0,numpy.nan)
23
+ m,p=numpy.nanmax(e,axis=2),numpy.nanargmax(e,axis=2)
24
+ h=self.chu_liu_edmonds(m)
25
+ z=[i for i,j in enumerate(h) if i==j]
26
+ if len(z)>1:
27
+ k,h=z[numpy.nanargmax(m[z,z])],numpy.nanmin(m)-numpy.nanmax(m)
28
+ m[:,z]+=[[0 if j in z and (i!=j or i==k) else h for i in z] for j in range(m.shape[0])]
29
+ h=self.chu_liu_edmonds(m)
30
+ v=[(s,e) for s,e in model_outputs["offset_mapping"][0].tolist() if s<e]
31
+ q=[self.model.config.id2label[p[j,i]].split("|") for i,j in enumerate(h)]
32
+ g="aggregation_strategy" in kwargs and kwargs["aggregation_strategy"]!="none"
33
+ if g:
34
+ for i,j in reversed(list(enumerate(q[1:],1))):
35
+ if j[-1]=="goeswith" and set([t[-1] for t in q[h[i]+1:i+1]])=={"goeswith"}:
36
+ h=[b if i>b else b-1 for a,b in enumerate(h) if i!=a]
37
+ v[i-1]=(v[i-1][0],v.pop(i)[1])
38
+ q.pop(i)
39
+ t=model_outputs["sentence"].replace("\n"," ")
40
+ u="# text = "+t+"\n"
41
+ for i,(s,e) in enumerate(v):
42
+ u+="\t".join([str(i+1),t[s:e],t[s:e] if g else "_",q[i][0],"_","|".join(q[i][1:-1]),str(0 if h[i]==i else h[i]+1),q[i][-1],"_","_" if i+1<len(v) and e<v[i+1][0] else "SpaceAfter=No"])+"\n"
43
+ return u+"\n"
44
+ def chu_liu_edmonds(self,matrix):
45
+ import numpy
46
+ h=numpy.nanargmax(matrix,axis=0)
47
+ x=[-1 if i==j else j for i,j in enumerate(h)]
48
+ for b in [lambda x,i,j:-1 if i not in x else x[i],lambda x,i,j:-1 if j<0 else x[j]]:
49
+ y=[]
50
+ while x!=y:
51
+ y=list(x)
52
+ for i,j in enumerate(x):
53
+ x[i]=b(x,i,j)
54
+ if max(x)<0:
55
+ return h
56
+ y,x=[i for i,j in enumerate(x) if j==max(x)],[i for i,j in enumerate(x) if j<max(x)]
57
+ z=matrix-numpy.nanmax(matrix,axis=0)
58
+ m=numpy.block([[z[x,:][:,x],numpy.nanmax(z[x,:][:,y],axis=1).reshape(len(x),1)],[numpy.nanmax(z[y,:][:,x],axis=0),numpy.nanmax(z[y,y])]])
59
+ k=[j if i==len(x) else x[j] if j<len(x) else y[numpy.nanargmax(z[y,x[i]])] for i,j in enumerate(self.chu_liu_edmonds(m))]
60
+ h=[j if i in y else k[x.index(i)] for i,j in enumerate(h)]
61
+ i=y[numpy.nanargmax(z[x[k[-1]],y] if k[-1]<len(x) else z[y,y])]
62
+ h[i]=x[k[-1]] if k[-1]<len(x) else i
63
+ return h