sugiv commited on
Commit
df494bc
·
verified ·
1 Parent(s): 4c6363c

Upload folder using huggingface_hub

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ components/train_DIETClassifier4/DIETClassifier.tf_model.data-00000-of-00001 filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,36 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Email Processing DIET Model
2
+
3
+ DIET (Dual Intent and Entity Transformer) model trained for email processing.
4
+
5
+ ## Model Capabilities
6
+
7
+ This model can extract:
8
+ - Email addresses
9
+ - Subject lines
10
+
11
+ And classify intents such as:
12
+ - provide_email
13
+ - provide_subject
14
+ - greeting
15
+ - thank_you
16
+ - goodbye
17
+
18
+ ## Usage with Rasa
19
+
20
+ ```python
21
+ from rasa.core.agent import Agent
22
+
23
+ # Load the model
24
+ agent = Agent.load('path/to/model')
25
+
26
+ # Use the model
27
+ async def test():
28
+ result = await agent.parse_message('Email is john.doe@example.com')
29
+ print(result)
30
+ ```
31
+
32
+ ## Model Information
33
+
34
+ - Model source: nlu-20250424-041833-grave-grouse.tar.gz
35
+ - Published date: 2025-04-24
36
+ - Framework: Rasa NLU with DIET architecture
components/finetuning_validator/fingerprints-for-validation.json ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ {
2
+ "rasa-version": "3.6.21",
3
+ "fingerprint-config": "5978f89b8dc39ff9a62218d79e405651",
4
+ "fingerprint-nlu": "eef5ec8a1cd035288f5cef2f5d5dcd16"
5
+ }
components/train_CountVectorsFeaturizer3/oov_words.json ADDED
@@ -0,0 +1 @@
 
 
1
+ []
components/train_CountVectorsFeaturizer3/vocabularies.json ADDED
@@ -0,0 +1,2387 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "text": {
3
+ " ": 0,
4
+ "g": 998,
5
+ "r": 1772,
6
+ "e": 769,
7
+ "t": 2074,
8
+ "i": 1103,
9
+ "n": 1384,
10
+ "s": 1942,
11
+ " g": 84,
12
+ "gr": 1031,
13
+ "re": 1812,
14
+ "ee": 823,
15
+ "et": 931,
16
+ "ti": 2137,
17
+ "in": 1149,
18
+ "ng": 1449,
19
+ "gs": 1034,
20
+ "s ": 1943,
21
+ " gr": 87,
22
+ "gre": 1032,
23
+ "ree": 1827,
24
+ "eet": 832,
25
+ "eti": 937,
26
+ "tin": 2144,
27
+ "ing": 1165,
28
+ "ngs": 1460,
29
+ "gs ": 1035,
30
+ " gre": 88,
31
+ "gree": 1033,
32
+ "reet": 1828,
33
+ "eeti": 833,
34
+ "etin": 938,
35
+ "ting": 2145,
36
+ "ings": 1169,
37
+ "ngs ": 1461,
38
+ "h": 1041,
39
+ "l": 1251,
40
+ "o": 1528,
41
+ " h": 89,
42
+ "he": 1068,
43
+ "el": 844,
44
+ "ll": 1294,
45
+ "lo": 1301,
46
+ "o ": 1529,
47
+ " he": 94,
48
+ "hel": 1075,
49
+ "ell": 846,
50
+ "llo": 1298,
51
+ "lo ": 1302,
52
+ " hel": 96,
53
+ "hell": 1076,
54
+ "ello": 848,
55
+ "llo ": 1299,
56
+ " t": 233,
57
+ "th": 2125,
58
+ "er": 880,
59
+ "e ": 770,
60
+ " th": 239,
61
+ "the": 2132,
62
+ "her": 1078,
63
+ "ere": 889,
64
+ "re ": 1813,
65
+ " the": 241,
66
+ "ther": 2134,
67
+ "here": 1079,
68
+ "ere ": 890,
69
+ "d": 693,
70
+ "go": 1028,
71
+ "oo": 1616,
72
+ "od": 1544,
73
+ "d ": 694,
74
+ " go": 85,
75
+ "goo": 1029,
76
+ "ood": 1617,
77
+ "od ": 1545,
78
+ " goo": 86,
79
+ "good": 1030,
80
+ "ood ": 1618,
81
+ "a": 431,
82
+ "f": 966,
83
+ " a": 7,
84
+ "af": 463,
85
+ "ft": 993,
86
+ "te": 2103,
87
+ "rn": 1877,
88
+ "no": 1481,
89
+ "on": 1588,
90
+ "n ": 1385,
91
+ " af": 18,
92
+ "aft": 464,
93
+ "fte": 994,
94
+ "ter": 2117,
95
+ "ern": 893,
96
+ "rno": 1882,
97
+ "noo": 1482,
98
+ "oon": 1622,
99
+ "on ": 1589,
100
+ " aft": 19,
101
+ "afte": 465,
102
+ "fter": 995,
103
+ "tern": 2121,
104
+ "erno": 895,
105
+ "rnoo": 1883,
106
+ "noon": 1483,
107
+ "oon ": 1623,
108
+ "hi": 1082,
109
+ "i ": 1104,
110
+ " hi": 98,
111
+ "hi ": 1083,
112
+ " hi ": 99,
113
+ "v": 2272,
114
+ "y": 2350,
115
+ " e": 64,
116
+ "ev": 941,
117
+ "ve": 2281,
118
+ "ry": 1934,
119
+ "yo": 2368,
120
+ "ne": 1427,
121
+ " ev": 70,
122
+ "eve": 943,
123
+ "ver": 2285,
124
+ "ery": 907,
125
+ "ryo": 1940,
126
+ "yon": 2369,
127
+ "one": 1601,
128
+ "ne ": 1428,
129
+ " eve": 71,
130
+ "ever": 945,
131
+ "very": 2288,
132
+ "eryo": 908,
133
+ "ryon": 1941,
134
+ "yone": 2370,
135
+ "one ": 1602,
136
+ "w": 2307,
137
+ " f": 74,
138
+ "fa": 968,
139
+ "ar": 526,
140
+ "ew": 949,
141
+ "we": 2318,
142
+ "l ": 1252,
143
+ " fa": 75,
144
+ "far": 969,
145
+ "are": 534,
146
+ "rew": 1847,
147
+ "ewe": 953,
148
+ "wel": 2321,
149
+ "ll ": 1295,
150
+ " far": 76,
151
+ "fare": 970,
152
+ "arew": 537,
153
+ "rewe": 1848,
154
+ "ewel": 954,
155
+ "well": 2322,
156
+ "ell ": 847,
157
+ " s": 206,
158
+ "se": 1973,
159
+ " se": 213,
160
+ "see": 1982,
161
+ "ee ": 824,
162
+ " see": 215,
163
+ "see ": 1983,
164
+ "u": 2189,
165
+ " y": 275,
166
+ "ou": 1662,
167
+ "u ": 2190,
168
+ " yo": 278,
169
+ "you": 2371,
170
+ "ou ": 1663,
171
+ " you": 279,
172
+ "you ": 2372,
173
+ "so": 2017,
174
+ " so": 221,
175
+ "soo": 2026,
176
+ " soo": 223,
177
+ "soon": 2027,
178
+ " i": 103,
179
+ " i ": 104,
180
+ "m": 1320,
181
+ " m": 132,
182
+ "m ": 1321,
183
+ " m ": 133,
184
+ " l": 124,
185
+ "le": 1269,
186
+ "ea": 789,
187
+ "av": 571,
188
+ "vi": 2289,
189
+ "g ": 999,
190
+ " le": 128,
191
+ "lea": 1273,
192
+ "eav": 802,
193
+ "avi": 574,
194
+ "vin": 2296,
195
+ "ng ": 1450,
196
+ " lea": 129,
197
+ "leav": 1275,
198
+ "eavi": 803,
199
+ "avin": 576,
200
+ "ving": 2297,
201
+ "ing ": 1166,
202
+ " n": 151,
203
+ "ow": 1677,
204
+ "w ": 2308,
205
+ " no": 157,
206
+ "now": 1486,
207
+ "ow ": 1678,
208
+ " now": 159,
209
+ "now ": 1487,
210
+ " u": 251,
211
+ "un": 2234,
212
+ "nt": 1499,
213
+ "il": 1134,
214
+ " un": 252,
215
+ "unt": 2240,
216
+ "nti": 1510,
217
+ "til": 2140,
218
+ "il ": 1135,
219
+ " unt": 253,
220
+ "unti": 2241,
221
+ "ntil": 1511,
222
+ "til ": 2141,
223
+ "x": 2338,
224
+ "ex": 957,
225
+ "xt": 2348,
226
+ "t ": 2075,
227
+ " ne": 152,
228
+ "nex": 1441,
229
+ "ext": 962,
230
+ "xt ": 2349,
231
+ " nex": 154,
232
+ "next": 1442,
233
+ "ext ": 963,
234
+ "im": 1143,
235
+ "me": 1343,
236
+ " ti": 242,
237
+ "tim": 2142,
238
+ "ime": 1145,
239
+ "me ": 1344,
240
+ " tim": 243,
241
+ "time": 2143,
242
+ "ime ": 1146,
243
+ "k": 1240,
244
+ "ta": 2086,
245
+ "ak": 480,
246
+ "ke": 1245,
247
+ " ta": 234,
248
+ "tak": 2093,
249
+ "ake": 481,
250
+ "ke ": 1246,
251
+ " tak": 236,
252
+ "take": 2094,
253
+ "ake ": 482,
254
+ "c": 619,
255
+ " c": 40,
256
+ "ca": 621,
257
+ " ca": 41,
258
+ "car": 624,
259
+ " car": 43,
260
+ "care": 625,
261
+ "are ": 535,
262
+ "mu": 1377,
263
+ "uc": 2199,
264
+ "ch": 643,
265
+ "h ": 1042,
266
+ " mu": 147,
267
+ "muc": 1378,
268
+ "uch": 2200,
269
+ "ch ": 644,
270
+ " muc": 148,
271
+ "much": 1379,
272
+ "uch ": 2201,
273
+ "p": 1683,
274
+ "ap": 521,
275
+ "pp": 1730,
276
+ "pr": 1739,
277
+ "ec": 804,
278
+ "ci": 652,
279
+ "ia": 1105,
280
+ "at": 555,
281
+ "ed": 816,
282
+ " ap": 23,
283
+ "app": 522,
284
+ "ppr": 1736,
285
+ "pre": 1740,
286
+ "rec": 1821,
287
+ "eci": 807,
288
+ "cia": 653,
289
+ "iat": 1106,
290
+ "ate": 557,
291
+ "ted": 2109,
292
+ "ed ": 817,
293
+ " app": 24,
294
+ "appr": 525,
295
+ "ppre": 1737,
296
+ "prec": 1741,
297
+ "reci": 1823,
298
+ "ecia": 808,
299
+ "ciat": 654,
300
+ "iate": 1107,
301
+ "ated": 560,
302
+ "ted ": 2110,
303
+ "ma": 1325,
304
+ "an": 500,
305
+ "ny": 1523,
306
+ "y ": 2351,
307
+ " ma": 134,
308
+ "man": 1331,
309
+ "any": 518,
310
+ "ny ": 1524,
311
+ " man": 136,
312
+ "many": 1332,
313
+ "any ": 519,
314
+ "ha": 1052,
315
+ "nk": 1474,
316
+ "ks": 1249,
317
+ "tha": 2129,
318
+ "han": 1053,
319
+ "ank": 511,
320
+ "nks": 1476,
321
+ "ks ": 1250,
322
+ " tha": 240,
323
+ "than": 2130,
324
+ "hank": 1056,
325
+ "anks": 513,
326
+ "nks ": 1477,
327
+ "b": 579,
328
+ "ab": 433,
329
+ "bs": 604,
330
+ "ol": 1569,
331
+ "lu": 1315,
332
+ "ut": 2268,
333
+ "ly": 1318,
334
+ " ab": 9,
335
+ "abs": 438,
336
+ "bso": 607,
337
+ "sol": 2020,
338
+ "olu": 1573,
339
+ "lut": 1316,
340
+ "ute": 2270,
341
+ "tel": 2113,
342
+ "ely": 852,
343
+ "ly ": 1319,
344
+ " abs": 11,
345
+ "abso": 439,
346
+ "bsol": 608,
347
+ "solu": 2021,
348
+ "olut": 1574,
349
+ "lute": 1317,
350
+ "utel": 2271,
351
+ "tely": 2114,
352
+ "ely ": 853,
353
+ "hat": 1061,
354
+ "at ": 556,
355
+ "that": 2131,
356
+ "hat ": 1062,
357
+ "is": 1197,
358
+ " is": 112,
359
+ "is ": 1198,
360
+ " is ": 113,
361
+ "co": 660,
362
+ "or": 1633,
363
+ "rr": 1900,
364
+ "ct": 679,
365
+ " co": 48,
366
+ "cor": 674,
367
+ "orr": 1648,
368
+ "rre": 1901,
369
+ "ect": 811,
370
+ "ct ": 680,
371
+ " cor": 51,
372
+ "corr": 675,
373
+ "orre": 1649,
374
+ "rrec": 1902,
375
+ "rect": 1824,
376
+ "ect ": 812,
377
+ "nf": 1443,
378
+ "fi": 976,
379
+ "ir": 1188,
380
+ "rm": 1873,
381
+ "con": 668,
382
+ "onf": 1603,
383
+ "nfi": 1444,
384
+ "fir": 983,
385
+ "irm": 1193,
386
+ "rm ": 1874,
387
+ " con": 50,
388
+ "conf": 671,
389
+ "onfi": 1604,
390
+ "nfir": 1445,
391
+ "firm": 984,
392
+ "irm ": 1194,
393
+ "nd": 1415,
394
+ "de": 723,
395
+ " in": 107,
396
+ "ind": 1157,
397
+ "nde": 1421,
398
+ "dee": 724,
399
+ "eed": 825,
400
+ " ind": 108,
401
+ "inde": 1158,
402
+ "ndee": 1422,
403
+ "deed": 725,
404
+ "eed ": 826,
405
+ "he ": 1069,
406
+ "the ": 2133,
407
+ "em": 854,
408
+ "ai": 473,
409
+ " em": 65,
410
+ "ema": 856,
411
+ "mai": 1326,
412
+ "ail": 474,
413
+ " ema": 66,
414
+ "emai": 857,
415
+ "mail": 1327,
416
+ "ail ": 475,
417
+ "ad": 454,
418
+ "dd": 719,
419
+ "dr": 758,
420
+ "es": 909,
421
+ "ss": 2033,
422
+ " ad": 15,
423
+ "add": 456,
424
+ "ddr": 721,
425
+ "dre": 759,
426
+ "res": 1838,
427
+ "ess": 921,
428
+ "ss ": 2034,
429
+ " add": 16,
430
+ "addr": 458,
431
+ "ddre": 722,
432
+ "dres": 760,
433
+ "ress": 1842,
434
+ "ess ": 922,
435
+ ".": 287,
436
+ "@": 360,
437
+ "w.": 2312,
438
+ ".u": 328,
439
+ "us": 2257,
440
+ "r@": 1777,
441
+ "@d": 373,
442
+ "do": 751,
443
+ "om": 1575,
444
+ "n.": 1386,
445
+ ".o": 314,
446
+ "rg": 1849,
447
+ "new": 1437,
448
+ "ew.": 951,
449
+ "w.u": 2313,
450
+ ".us": 331,
451
+ "use": 2259,
452
+ "ser": 1986,
453
+ "er@": 884,
454
+ "r@d": 1780,
455
+ "@do": 376,
456
+ "dom": 754,
457
+ "oma": 1577,
458
+ "ain": 477,
459
+ "in.": 1150,
460
+ "n.o": 1393,
461
+ ".or": 315,
462
+ "org": 1639,
463
+ "rg ": 1850,
464
+ " new": 153,
465
+ "new.": 1439,
466
+ "ew.u": 952,
467
+ "w.us": 2314,
468
+ ".use": 332,
469
+ "user": 2261,
470
+ "ser@": 1988,
471
+ "er@d": 885,
472
+ "r@do": 1781,
473
+ "@dom": 377,
474
+ "doma": 755,
475
+ "omai": 1578,
476
+ "main": 1328,
477
+ "ain.": 478,
478
+ "in.o": 1152,
479
+ "n.or": 1394,
480
+ ".org": 316,
481
+ "org ": 1640,
482
+ " p": 171,
483
+ "pl": 1711,
484
+ "as": 546,
485
+ " pl": 176,
486
+ "ple": 1712,
487
+ "eas": 800,
488
+ "ase": 548,
489
+ "se ": 1974,
490
+ " ple": 177,
491
+ "plea": 1714,
492
+ "leas": 1274,
493
+ "ease": 801,
494
+ "ase ": 549,
495
+ " r": 193,
496
+ "sp": 2030,
497
+ "po": 1722,
498
+ " re": 194,
499
+ "esp": 919,
500
+ "spo": 2031,
501
+ "pon": 1723,
502
+ "ond": 1598,
503
+ "nd ": 1416,
504
+ " res": 200,
505
+ "resp": 1841,
506
+ "espo": 920,
507
+ "spon": 2032,
508
+ "pond": 1724,
509
+ "ond ": 1599,
510
+ "to": 2158,
511
+ " to": 245,
512
+ "to ": 2159,
513
+ " to ": 246,
514
+ "cl": 657,
515
+ "li": 1286,
516
+ "ie": 1119,
517
+ "en": 858,
518
+ "t@": 2081,
519
+ "@b": 364,
520
+ "bu": 609,
521
+ "si": 1999,
522
+ "s.": 1944,
523
+ ".n": 311,
524
+ " cl": 46,
525
+ "cli": 658,
526
+ "lie": 1289,
527
+ "ien": 1120,
528
+ "ent": 864,
529
+ "nt@": 1501,
530
+ "t@b": 2082,
531
+ "@bu": 367,
532
+ "bus": 614,
533
+ "usi": 2262,
534
+ "sin": 2000,
535
+ "ine": 1159,
536
+ "nes": 1433,
537
+ "ss.": 2035,
538
+ "s.n": 1949,
539
+ ".ne": 312,
540
+ "net": 1435,
541
+ "et ": 932,
542
+ " cli": 47,
543
+ "clie": 659,
544
+ "lien": 1290,
545
+ "ient": 1121,
546
+ "ent@": 866,
547
+ "nt@b": 1502,
548
+ "t@bu": 2083,
549
+ "@bus": 368,
550
+ "busi": 615,
551
+ "usin": 2263,
552
+ "sine": 2001,
553
+ "ines": 1162,
554
+ "ness": 1434,
555
+ "ess.": 923,
556
+ "ss.n": 2037,
557
+ "s.ne": 1950,
558
+ ".net": 313,
559
+ "net ": 1436,
560
+ "can": 622,
561
+ "an ": 501,
562
+ " can": 42,
563
+ "can ": 623,
564
+ " b": 30,
565
+ "be": 584,
566
+ " be": 31,
567
+ "be ": 585,
568
+ " be ": 32,
569
+ "ac": 440,
570
+ "rea": 1818,
571
+ "eac": 790,
572
+ "ach": 445,
573
+ "che": 650,
574
+ "hed": 1072,
575
+ " rea": 195,
576
+ "reac": 1819,
577
+ "each": 791,
578
+ "ache": 447,
579
+ "ched": 651,
580
+ "hed ": 1073,
581
+ " at": 28,
582
+ " at ": 29,
583
+ "1": 337,
584
+ "2": 341,
585
+ "3": 353,
586
+ "pe": 1702,
587
+ "rs": 1905,
588
+ "n1": 1397,
589
+ "12": 338,
590
+ "23": 346,
591
+ "3@": 355,
592
+ "@m": 390,
593
+ "l.": 1253,
594
+ ".c": 291,
595
+ " pe": 172,
596
+ "per": 1703,
597
+ "ers": 897,
598
+ "rso": 1911,
599
+ "son": 2022,
600
+ "on1": 1592,
601
+ "n12": 1398,
602
+ "123": 339,
603
+ "23@": 347,
604
+ "3@m": 358,
605
+ "@ma": 391,
606
+ "il.": 1136,
607
+ "l.c": 1254,
608
+ ".co": 294,
609
+ "co ": 661,
610
+ " per": 173,
611
+ "pers": 1704,
612
+ "erso": 901,
613
+ "rson": 1912,
614
+ "son1": 2023,
615
+ "on12": 1593,
616
+ "n123": 1399,
617
+ "123@": 340,
618
+ "23@m": 349,
619
+ "3@ma": 359,
620
+ "@mai": 392,
621
+ "ail.": 476,
622
+ "il.c": 1137,
623
+ "l.co": 1255,
624
+ ".co ": 295,
625
+ "lp": 1305,
626
+ "p@": 1688,
627
+ "@s": 404,
628
+ "su": 2060,
629
+ "up": 2242,
630
+ "rt": 1915,
631
+ "t.": 2076,
632
+ "ce": 631,
633
+ "r ": 1773,
634
+ "elp": 849,
635
+ "lp@": 1306,
636
+ "p@s": 1689,
637
+ "@su": 410,
638
+ "sup": 2066,
639
+ "upp": 2246,
640
+ "ppo": 1734,
641
+ "por": 1726,
642
+ "ort": 1651,
643
+ "rt.": 1917,
644
+ "t.c": 2077,
645
+ ".ce": 292,
646
+ "cen": 636,
647
+ "nte": 1506,
648
+ "er ": 881,
649
+ "help": 1077,
650
+ "elp@": 850,
651
+ "lp@s": 1307,
652
+ "p@su": 1691,
653
+ "@sup": 411,
654
+ "supp": 2067,
655
+ "uppo": 2247,
656
+ "ppor": 1735,
657
+ "port": 1727,
658
+ "ort.": 1653,
659
+ "rt.c": 1918,
660
+ "t.ce": 2078,
661
+ ".cen": 293,
662
+ "cent": 637,
663
+ "ente": 868,
664
+ "nter": 1507,
665
+ "ter ": 2118,
666
+ " w": 263,
667
+ "wh": 2323,
668
+ " wh": 266,
669
+ "whe": 2324,
670
+ " whe": 267,
671
+ "wher": 2325,
672
+ "sh": 1993,
673
+ "ho": 1093,
674
+ "ul": 2221,
675
+ "ld": 1265,
676
+ " sh": 219,
677
+ "sho": 1996,
678
+ "hou": 1096,
679
+ "oul": 1664,
680
+ "uld": 2222,
681
+ "ld ": 1266,
682
+ " sho": 220,
683
+ "shou": 1998,
684
+ "houl": 1097,
685
+ "ould": 1665,
686
+ "uld ": 2223,
687
+ "wr": 2332,
688
+ "ri": 1855,
689
+ "it": 1204,
690
+ " wr": 270,
691
+ "wri": 2333,
692
+ "rit": 1862,
693
+ "ite": 1208,
694
+ "te ": 2104,
695
+ " wri": 271,
696
+ "writ": 2334,
697
+ "rite": 1863,
698
+ "ite ": 1209,
699
+ "sen": 1984,
700
+ "end": 859,
701
+ " sen": 216,
702
+ "send": 1985,
703
+ "end ": 860,
704
+ "ur": 2248,
705
+ "our": 1668,
706
+ "ur ": 2249,
707
+ "your": 2373,
708
+ "our ": 1669,
709
+ "ep": 869,
710
+ "rep": 1833,
711
+ "epl": 872,
712
+ "ply": 1718,
713
+ " rep": 198,
714
+ "repl": 1834,
715
+ "eply": 873,
716
+ "ply ": 1719,
717
+ "ns": 1491,
718
+ "e@": 780,
719
+ "@c": 369,
720
+ "mp": 1370,
721
+ "pa": 1692,
722
+ "y.": 2352,
723
+ "ons": 1607,
724
+ "nse": 1495,
725
+ "se@": 1975,
726
+ "e@c": 783,
727
+ "@co": 370,
728
+ "com": 664,
729
+ "omp": 1585,
730
+ "mpa": 1371,
731
+ "pan": 1693,
732
+ "ny.": 1525,
733
+ "y.c": 2353,
734
+ "om ": 1576,
735
+ "pons": 1725,
736
+ "onse": 1610,
737
+ "nse@": 1496,
738
+ "se@c": 1976,
739
+ "e@co": 784,
740
+ "@com": 371,
741
+ "comp": 667,
742
+ "ompa": 1586,
743
+ "mpan": 1372,
744
+ "pany": 1694,
745
+ "any.": 520,
746
+ "ny.c": 1526,
747
+ "y.co": 2354,
748
+ ".com": 297,
749
+ "com ": 665,
750
+ "my": 1382,
751
+ " my": 149,
752
+ "my ": 1383,
753
+ " my ": 150,
754
+ "al": 483,
755
+ "lt": 1310,
756
+ "na": 1403,
757
+ " al": 20,
758
+ "alt": 492,
759
+ "lte": 1311,
760
+ "rna": 1878,
761
+ "nat": 1408,
762
+ " alt": 22,
763
+ "alte": 493,
764
+ "lter": 1312,
765
+ "erna": 894,
766
+ "rnat": 1879,
767
+ "nate": 1409,
768
+ "ate ": 558,
769
+ "da": 703,
770
+ "y@": 2361,
771
+ "@p": 396,
772
+ ".m": 306,
773
+ "sec": 1979,
774
+ "eco": 809,
775
+ "nda": 1419,
776
+ "dar": 704,
777
+ "ary": 543,
778
+ "ry@": 1938,
779
+ "y@p": 2362,
780
+ "@pe": 397,
781
+ "ona": 1594,
782
+ "nal": 1404,
783
+ "al.": 485,
784
+ "l.m": 1256,
785
+ ".me": 309,
786
+ " sec": 214,
787
+ "seco": 1980,
788
+ "econ": 810,
789
+ "cond": 670,
790
+ "onda": 1600,
791
+ "ndar": 1420,
792
+ "dary": 705,
793
+ "ary@": 545,
794
+ "ry@p": 1939,
795
+ "y@pe": 2363,
796
+ "@per": 398,
797
+ "sona": 2024,
798
+ "onal": 1595,
799
+ "nal.": 1405,
800
+ "al.m": 486,
801
+ "l.me": 1257,
802
+ ".me ": 310,
803
+ " o": 160,
804
+ "rd": 1805,
805
+ "s@": 1951,
806
+ "op": 1624,
807
+ "p.": 1685,
808
+ ".s": 320,
809
+ "st": 2045,
810
+ " or": 167,
811
+ "ord": 1635,
812
+ "rde": 1808,
813
+ "der": 730,
814
+ "rs@": 1907,
815
+ "s@s": 1956,
816
+ "@sh": 408,
817
+ "hop": 1094,
818
+ "op.": 1625,
819
+ "p.s": 1686,
820
+ ".st": 323,
821
+ "sto": 2055,
822
+ "tor": 2167,
823
+ "ore": 1637,
824
+ " ord": 168,
825
+ "orde": 1636,
826
+ "rder": 1809,
827
+ "ders": 732,
828
+ "ers@": 899,
829
+ "rs@s": 1908,
830
+ "s@sh": 1957,
831
+ "@sho": 409,
832
+ "shop": 1997,
833
+ "hop.": 1095,
834
+ "op.s": 1626,
835
+ "p.st": 1687,
836
+ ".sto": 324,
837
+ "stor": 2057,
838
+ "tore": 2168,
839
+ "ore ": 1638,
840
+ "ro": 1886,
841
+ "oc": 1541,
842
+ " pr": 178,
843
+ "pro": 1746,
844
+ "roc": 1888,
845
+ "oce": 1542,
846
+ "ces": 640,
847
+ "sse": 2040,
848
+ "ses": 1990,
849
+ "es ": 910,
850
+ " pro": 181,
851
+ "proc": 1748,
852
+ "roce": 1889,
853
+ "oces": 1543,
854
+ "cess": 642,
855
+ "esse": 925,
856
+ "sses": 2041,
857
+ "ses ": 1991,
858
+ "all": 490,
859
+ " all": 21,
860
+ "all ": 491,
861
+ "pu": 1756,
862
+ "rc": 1800,
863
+ " pu": 182,
864
+ "pur": 1757,
865
+ "urc": 2250,
866
+ "rch": 1801,
867
+ "cha": 647,
868
+ "has": 1058,
869
+ " pur": 183,
870
+ "purc": 1758,
871
+ "urch": 2251,
872
+ "rcha": 1804,
873
+ "chas": 649,
874
+ "hase": 1060,
875
+ "ases": 550,
876
+ "hr": 1098,
877
+ " hr": 101,
878
+ "hr@": 1099,
879
+ "r@c": 1778,
880
+ "y.o": 2357,
881
+ " hr@": 102,
882
+ "hr@c": 1100,
883
+ "r@co": 1779,
884
+ "ny.o": 1527,
885
+ "y.or": 2358,
886
+ "dl": 745,
887
+ " ha": 90,
888
+ "and": 504,
889
+ "ndl": 1423,
890
+ "dle": 746,
891
+ "les": 1280,
892
+ " han": 91,
893
+ "hand": 1054,
894
+ "andl": 506,
895
+ "ndle": 1424,
896
+ "dles": 747,
897
+ "les ": 1281,
898
+ "nn": 1478,
899
+ "onn": 1605,
900
+ "nne": 1479,
901
+ "nel": 1431,
902
+ "el ": 845,
903
+ "sonn": 2025,
904
+ "onne": 1606,
905
+ "nnel": 1480,
906
+ "nel ": 1432,
907
+ "tt": 2177,
908
+ "mat": 1337,
909
+ "att": 567,
910
+ "tte": 2178,
911
+ "rs ": 1906,
912
+ " mat": 138,
913
+ "matt": 1339,
914
+ "atte": 568,
915
+ "tter": 2179,
916
+ "ters": 2122,
917
+ "ers ": 898,
918
+ "tl": 2152,
919
+ "tit": 2148,
920
+ "itl": 1212,
921
+ "tle": 2153,
922
+ "le ": 1270,
923
+ " tit": 244,
924
+ "titl": 2149,
925
+ "itle": 1213,
926
+ "tle ": 2154,
927
+ " it": 114,
928
+ "it ": 1205,
929
+ " it ": 115,
930
+ "q": 1759,
931
+ "eq": 876,
932
+ "qu": 1764,
933
+ "ue": 2207,
934
+ "req": 1836,
935
+ "equ": 877,
936
+ "que": 1767,
937
+ "ues": 2208,
938
+ "est": 927,
939
+ "st ": 2046,
940
+ " req": 199,
941
+ "requ": 1837,
942
+ "eque": 878,
943
+ "ques": 1768,
944
+ "uest": 2209,
945
+ "est ": 928,
946
+ "fo": 985,
947
+ " fo": 81,
948
+ "for": 990,
949
+ "or ": 1634,
950
+ " for": 83,
951
+ "for ": 991,
952
+ "nc": 1410,
953
+ " as": 25,
954
+ "ass": 553,
955
+ "ssi": 2042,
956
+ "sis": 2004,
957
+ "ist": 1201,
958
+ "sta": 2047,
959
+ "tan": 2097,
960
+ "anc": 502,
961
+ "nce": 1411,
962
+ "ce ": 632,
963
+ " ass": 27,
964
+ "assi": 554,
965
+ "ssis": 2044,
966
+ "sist": 2005,
967
+ "ista": 1202,
968
+ "stan": 2048,
969
+ "tanc": 2098,
970
+ "ance": 503,
971
+ "nce ": 1412,
972
+ "j": 1225,
973
+ "ub": 2194,
974
+ "bj": 593,
975
+ "je": 1229,
976
+ " su": 228,
977
+ "sub": 2061,
978
+ "ubj": 2195,
979
+ "bje": 594,
980
+ "jec": 1230,
981
+ " sub": 229,
982
+ "subj": 2062,
983
+ "ubje": 2196,
984
+ "bjec": 595,
985
+ "ject": 1231,
986
+ "of": 1556,
987
+ "f ": 967,
988
+ " of": 161,
989
+ "of ": 1557,
990
+ " of ": 162,
991
+ "du": 763,
992
+ "rod": 1890,
993
+ "odu": 1550,
994
+ "duc": 765,
995
+ "uct": 2202,
996
+ "prod": 1749,
997
+ "rodu": 1891,
998
+ "oduc": 1551,
999
+ "duct": 766,
1000
+ "uct ": 2203,
1001
+ "nq": 1488,
1002
+ "ui": 2210,
1003
+ "inq": 1173,
1004
+ "nqu": 1489,
1005
+ "qui": 1769,
1006
+ "uir": 2213,
1007
+ "iry": 1195,
1008
+ "ry ": 1935,
1009
+ " inq": 110,
1010
+ "inqu": 1174,
1011
+ "nqui": 1490,
1012
+ "quir": 1770,
1013
+ "uiry": 2216,
1014
+ "iry ": 1196,
1015
+ "mo": 1362,
1016
+ " mo": 143,
1017
+ "mod": 1363,
1018
+ "ode": 1548,
1019
+ "del": 726,
1020
+ " mod": 144,
1021
+ "mode": 1364,
1022
+ "odel": 1549,
1023
+ "del ": 727,
1024
+ "0": 333,
1025
+ " x": 272,
1026
+ "x2": 2339,
1027
+ "20": 343,
1028
+ "00": 335,
1029
+ "0 ": 334,
1030
+ " x2": 273,
1031
+ "x20": 2340,
1032
+ "200": 344,
1033
+ "00 ": 336,
1034
+ " x20": 274,
1035
+ "x200": 2341,
1036
+ "200 ": 345,
1037
+ " us": 258,
1038
+ " use": 259,
1039
+ "use ": 2260,
1040
+ "ic": 1108,
1041
+ "io": 1178,
1042
+ "ppl": 1732,
1043
+ "pli": 1716,
1044
+ "lic": 1287,
1045
+ "ica": 1110,
1046
+ "cat": 626,
1047
+ "ati": 564,
1048
+ "tio": 2146,
1049
+ "ion": 1179,
1050
+ "appl": 524,
1051
+ "ppli": 1733,
1052
+ "plic": 1717,
1053
+ "lica": 1288,
1054
+ "icat": 1111,
1055
+ "cati": 627,
1056
+ "atio": 565,
1057
+ "tion": 2147,
1058
+ "ion ": 1180,
1059
+ "tu": 2180,
1060
+ " st": 225,
1061
+ "tat": 2100,
1062
+ "atu": 569,
1063
+ "tus": 2181,
1064
+ "us ": 2258,
1065
+ " sta": 226,
1066
+ "stat": 2049,
1067
+ "tatu": 2102,
1068
+ "atus": 570,
1069
+ "tus ": 2182,
1070
+ "pd": 1697,
1071
+ " up": 254,
1072
+ "upd": 2244,
1073
+ "pda": 1698,
1074
+ "dat": 708,
1075
+ " upd": 255,
1076
+ "upda": 2245,
1077
+ "pdat": 1699,
1078
+ "date": 709,
1079
+ " li": 130,
1080
+ "lin": 1291,
1081
+ " lin": 131,
1082
+ "line": 1292,
1083
+ "ine ": 1160,
1084
+ "hl": 1086,
1085
+ "mon": 1365,
1086
+ "ont": 1612,
1087
+ "nth": 1508,
1088
+ "thl": 2135,
1089
+ "hly": 1087,
1090
+ " mon": 145,
1091
+ "mont": 1366,
1092
+ "onth": 1614,
1093
+ "nthl": 1509,
1094
+ "thly": 2136,
1095
+ "hly ": 1088,
1096
+ "epo": 874,
1097
+ "rt ": 1916,
1098
+ "repo": 1835,
1099
+ "epor": 875,
1100
+ "ort ": 1652,
1101
+ "mar": 1333,
1102
+ "arc": 529,
1103
+ " mar": 137,
1104
+ "marc": 1334,
1105
+ "arch": 530,
1106
+ "rch ": 1802,
1107
+ "ig": 1129,
1108
+ "gu": 1036,
1109
+ " fi": 79,
1110
+ "fig": 979,
1111
+ "igu": 1132,
1112
+ "gur": 1037,
1113
+ "ure": 2252,
1114
+ " fig": 80,
1115
+ "figu": 980,
1116
+ "igur": 1133,
1117
+ "gure": 1038,
1118
+ "ures": 2254,
1119
+ "res ": 1839,
1120
+ "mak": 1329,
1121
+ " mak": 135,
1122
+ "make": 1330,
1123
+ "fe": 971,
1124
+ "db": 714,
1125
+ "ba": 581,
1126
+ "ck": 655,
1127
+ "k ": 1241,
1128
+ " fe": 77,
1129
+ "fee": 972,
1130
+ "edb": 818,
1131
+ "dba": 715,
1132
+ "bac": 582,
1133
+ "ack": 448,
1134
+ "ck ": 656,
1135
+ " fee": 78,
1136
+ "feed": 973,
1137
+ "eedb": 827,
1138
+ "edba": 819,
1139
+ "dbac": 716,
1140
+ "back": 583,
1141
+ "ack ": 449,
1142
+ " on": 163,
1143
+ " on ": 164,
1144
+ "ece": 805,
1145
+ "nt ": 1500,
1146
+ " rec": 196,
1147
+ "rece": 1822,
1148
+ "ecen": 806,
1149
+ "ent ": 865,
1150
+ "ge": 1011,
1151
+ " ch": 44,
1152
+ "ang": 507,
1153
+ "nge": 1455,
1154
+ "ges": 1015,
1155
+ " cha": 45,
1156
+ "chan": 648,
1157
+ "hang": 1055,
1158
+ "ange": 508,
1159
+ "nges": 1457,
1160
+ "ges ": 1016,
1161
+ "onc": 1596,
1162
+ "cer": 638,
1163
+ "rns": 1884,
1164
+ "ns ": 1492,
1165
+ "conc": 669,
1166
+ "once": 1597,
1167
+ "ncer": 1414,
1168
+ "cern": 639,
1169
+ "erns": 896,
1170
+ "rns ": 1885,
1171
+ "sc": 1966,
1172
+ " sc": 211,
1173
+ "sch": 1967,
1174
+ "edu": 820,
1175
+ "dul": 767,
1176
+ "ule": 2224,
1177
+ " sch": 212,
1178
+ "sche": 1968,
1179
+ "hedu": 1074,
1180
+ "edul": 822,
1181
+ "dule": 768,
1182
+ "ule ": 2225,
1183
+ "ge ": 1012,
1184
+ "nge ": 1456,
1185
+ " v": 260,
1186
+ "va": 2274,
1187
+ " va": 261,
1188
+ "vac": 2275,
1189
+ "aca": 441,
1190
+ " vac": 262,
1191
+ "vaca": 2276,
1192
+ "acat": 442,
1193
+ " j": 116,
1194
+ "ju": 1237,
1195
+ " ju": 122,
1196
+ "jul": 1238,
1197
+ "uly": 2229,
1198
+ " jul": 123,
1199
+ "july": 1239,
1200
+ "uly ": 2230,
1201
+ "_": 417,
1202
+ "-": 280,
1203
+ " _": 4,
1204
+ "__": 422,
1205
+ "_n": 428,
1206
+ "nu": 1515,
1207
+ "um": 2231,
1208
+ "mb": 1340,
1209
+ "r_": 1784,
1210
+ "_-": 419,
1211
+ "-_": 281,
1212
+ "_ ": 418,
1213
+ " __": 5,
1214
+ "__n": 426,
1215
+ "_nu": 429,
1216
+ "num": 1516,
1217
+ "umb": 2232,
1218
+ "mbe": 1341,
1219
+ "ber": 588,
1220
+ "er_": 887,
1221
+ "r__": 1785,
1222
+ "__-": 424,
1223
+ "_-_": 420,
1224
+ "-__": 282,
1225
+ "__ ": 423,
1226
+ " __n": 6,
1227
+ "__nu": 427,
1228
+ "_num": 430,
1229
+ "numb": 1517,
1230
+ "umbe": 2233,
1231
+ "mber": 1342,
1232
+ "ber_": 589,
1233
+ "er__": 888,
1234
+ "r__-": 1787,
1235
+ "__-_": 425,
1236
+ "_-__": 421,
1237
+ "-__n": 283,
1238
+ "r__ ": 1786,
1239
+ "dd ": 720,
1240
+ "add ": 457,
1241
+ " q": 184,
1242
+ " qu": 189,
1243
+ "sti": 2053,
1244
+ " que": 191,
1245
+ "esti": 930,
1246
+ "stio": 2054,
1247
+ "bo": 596,
1248
+ "abo": 436,
1249
+ "bou": 599,
1250
+ "out": 1672,
1251
+ "ut ": 2269,
1252
+ " abo": 10,
1253
+ "abou": 437,
1254
+ "bout": 600,
1255
+ "out ": 1673,
1256
+ "rv": 1929,
1257
+ "erv": 904,
1258
+ "rvi": 1932,
1259
+ "vic": 2290,
1260
+ "ice": 1112,
1261
+ " ser": 217,
1262
+ "serv": 1989,
1263
+ "ervi": 906,
1264
+ "rvic": 1933,
1265
+ "vice": 2291,
1266
+ "ice ": 1113,
1267
+ "pt": 1753,
1268
+ " op": 165,
1269
+ "opt": 1631,
1270
+ "pti": 1754,
1271
+ " opt": 166,
1272
+ "opti": 1632,
1273
+ "ptio": 1755,
1274
+ "ions": 1182,
1275
+ "ons ": 1608,
1276
+ "as ": 547,
1277
+ " as ": 26,
1278
+ "ni": 1462,
1279
+ "mor": 1367,
1280
+ "orn": 1646,
1281
+ "rni": 1880,
1282
+ "nin": 1466,
1283
+ " mor": 146,
1284
+ "morn": 1368,
1285
+ "orni": 1647,
1286
+ "rnin": 1881,
1287
+ "ning": 1467,
1288
+ "ey": 964,
1289
+ "hey": 1080,
1290
+ "ey ": 965,
1291
+ " hey": 97,
1292
+ "hey ": 1081,
1293
+ "by": 616,
1294
+ "ye": 2364,
1295
+ " by": 38,
1296
+ "bye": 617,
1297
+ "ye ": 2365,
1298
+ " bye": 39,
1299
+ "bye ": 618,
1300
+ "odb": 1546,
1301
+ "dby": 717,
1302
+ "oodb": 1619,
1303
+ "odby": 1547,
1304
+ "dbye": 718,
1305
+ "la": 1260,
1306
+ " la": 125,
1307
+ "lat": 1263,
1308
+ " lat": 127,
1309
+ "late": 1264,
1310
+ "ater": 562,
1311
+ "hav": 1063,
1312
+ "ave": 572,
1313
+ "ve ": 2282,
1314
+ " hav": 93,
1315
+ "have": 1064,
1316
+ "ave ": 573,
1317
+ "a ": 432,
1318
+ " a ": 8,
1319
+ " ni": 155,
1320
+ "nic": 1463,
1321
+ " nic": 156,
1322
+ "nice": 1465,
1323
+ " d": 54,
1324
+ "ay": 577,
1325
+ " da": 55,
1326
+ "day": 712,
1327
+ "ay ": 578,
1328
+ " day": 57,
1329
+ "day ": 713,
1330
+ "xi": 2345,
1331
+ " ex": 72,
1332
+ "exi": 960,
1333
+ "xit": 2346,
1334
+ " exi": 73,
1335
+ "exit": 961,
1336
+ "xit ": 2347,
1337
+ " en": 67,
1338
+ " end": 68,
1339
+ "uit": 2217,
1340
+ " qui": 192,
1341
+ "quit": 1771,
1342
+ "uit ": 2218,
1343
+ "nk ": 1475,
1344
+ "ank ": 512,
1345
+ " ye": 276,
1346
+ "yes": 2366,
1347
+ " yes": 277,
1348
+ "yes ": 2367,
1349
+ " s ": 207,
1350
+ "gh": 1019,
1351
+ "ht": 1101,
1352
+ " ri": 202,
1353
+ "rig": 1858,
1354
+ "igh": 1130,
1355
+ "ght": 1020,
1356
+ "ht ": 1102,
1357
+ " rig": 203,
1358
+ "righ": 1859,
1359
+ "ight": 1131,
1360
+ "ght ": 1021,
1361
+ "ds": 761,
1362
+ "sou": 2028,
1363
+ "oun": 1666,
1364
+ "und": 2235,
1365
+ "nds": 1425,
1366
+ "ds ": 762,
1367
+ " sou": 224,
1368
+ "soun": 2029,
1369
+ "ound": 1667,
1370
+ "unds": 2236,
1371
+ "nds ": 1426,
1372
+ "jo": 1232,
1373
+ "oh": 1560,
1374
+ "hn": 1089,
1375
+ ".d": 298,
1376
+ "oe": 1552,
1377
+ "@e": 378,
1378
+ "xa": 2342,
1379
+ "am": 494,
1380
+ "e.": 771,
1381
+ " jo": 119,
1382
+ "joh": 1235,
1383
+ "ohn": 1561,
1384
+ "hn.": 1090,
1385
+ "n.d": 1389,
1386
+ ".do": 301,
1387
+ "doe": 752,
1388
+ "oe@": 1553,
1389
+ "e@e": 785,
1390
+ "@ex": 379,
1391
+ "exa": 958,
1392
+ "xam": 2343,
1393
+ "amp": 498,
1394
+ "mpl": 1373,
1395
+ "le.": 1271,
1396
+ "e.c": 772,
1397
+ " joh": 121,
1398
+ "john": 1236,
1399
+ "ohn.": 1562,
1400
+ "hn.d": 1091,
1401
+ "n.do": 1390,
1402
+ ".doe": 302,
1403
+ "doe@": 753,
1404
+ "oe@e": 1555,
1405
+ "e@ex": 786,
1406
+ "@exa": 380,
1407
+ "exam": 959,
1408
+ "xamp": 2344,
1409
+ "ampl": 499,
1410
+ "mple": 1374,
1411
+ "ple.": 1713,
1412
+ "le.c": 1272,
1413
+ "e.co": 773,
1414
+ "sa": 1958,
1415
+ "ra": 1788,
1416
+ "ah": 470,
1417
+ "h@": 1043,
1418
+ " sa": 208,
1419
+ "sar": 1964,
1420
+ "ara": 527,
1421
+ "rah": 1791,
1422
+ "ah@": 471,
1423
+ "h@c": 1044,
1424
+ " sar": 210,
1425
+ "sara": 1965,
1426
+ "arah": 528,
1427
+ "rah@": 1792,
1428
+ "ah@c": 472,
1429
+ "h@co": 1045,
1430
+ "r1": 1774,
1431
+ "er1": 882,
1432
+ "r12": 1775,
1433
+ "3@d": 356,
1434
+ "n.n": 1391,
1435
+ "ser1": 1987,
1436
+ "er12": 883,
1437
+ "r123": 1776,
1438
+ "23@d": 348,
1439
+ "3@do": 357,
1440
+ "in.n": 1151,
1441
+ "n.ne": 1392,
1442
+ "nta": 1503,
1443
+ "tac": 2087,
1444
+ "act": 450,
1445
+ "cont": 673,
1446
+ "onta": 1613,
1447
+ "ntac": 1504,
1448
+ "tact": 2088,
1449
+ "act ": 451,
1450
+ "o.": 1530,
1451
+ "uk": 2219,
1452
+ "ct@": 683,
1453
+ "s.c": 1947,
1454
+ "co.": 662,
1455
+ "o.u": 1531,
1456
+ ".uk": 329,
1457
+ "uk ": 2220,
1458
+ "act@": 452,
1459
+ "ct@b": 684,
1460
+ "ss.c": 2036,
1461
+ "s.co": 1948,
1462
+ ".co.": 296,
1463
+ "co.u": 663,
1464
+ "o.uk": 1532,
1465
+ ".uk ": 330,
1466
+ "z": 2377,
1467
+ "o@": 1533,
1468
+ "@o": 393,
1469
+ "ga": 1006,
1470
+ "iz": 1222,
1471
+ "za": 2378,
1472
+ "inf": 1163,
1473
+ "nfo": 1446,
1474
+ "fo@": 986,
1475
+ "o@o": 1534,
1476
+ "@or": 394,
1477
+ "rga": 1851,
1478
+ "gan": 1007,
1479
+ "ani": 509,
1480
+ "niz": 1472,
1481
+ "iza": 1223,
1482
+ "zat": 2379,
1483
+ "on.": 1590,
1484
+ "n.c": 1387,
1485
+ " inf": 109,
1486
+ "info": 1164,
1487
+ "nfo@": 1447,
1488
+ "fo@o": 987,
1489
+ "o@or": 1535,
1490
+ "@org": 395,
1491
+ "orga": 1641,
1492
+ "rgan": 1852,
1493
+ "gani": 1008,
1494
+ "aniz": 510,
1495
+ "niza": 1473,
1496
+ "izat": 1224,
1497
+ "zati": 2380,
1498
+ "ion.": 1181,
1499
+ "on.c": 1591,
1500
+ "n.co": 1388,
1501
+ "@h": 387,
1502
+ "sk": 2008,
1503
+ "k.": 1242,
1504
+ "rt@": 1919,
1505
+ "t@h": 2084,
1506
+ "@he": 388,
1507
+ "lpd": 1308,
1508
+ "pde": 1700,
1509
+ "des": 733,
1510
+ "esk": 917,
1511
+ "sk.": 2009,
1512
+ "k.n": 1243,
1513
+ " sup": 230,
1514
+ "ort@": 1654,
1515
+ "rt@h": 1920,
1516
+ "t@he": 2085,
1517
+ "@hel": 389,
1518
+ "elpd": 851,
1519
+ "lpde": 1309,
1520
+ "pdes": 1701,
1521
+ "desk": 734,
1522
+ "esk.": 918,
1523
+ "sk.n": 2010,
1524
+ "k.ne": 1244,
1525
+ "iri": 1191,
1526
+ "rie": 1856,
1527
+ "ies": 1122,
1528
+ "uiri": 2215,
1529
+ "irie": 1192,
1530
+ "ries": 1857,
1531
+ "ies ": 1123,
1532
+ "sm": 2014,
1533
+ "mi": 1353,
1534
+ "@g": 384,
1535
+ "gm": 1025,
1536
+ "ry.": 1936,
1537
+ "y.s": 2359,
1538
+ ".sm": 321,
1539
+ "smi": 2015,
1540
+ "mit": 1357,
1541
+ "ith": 1210,
1542
+ "th@": 2126,
1543
+ "h@g": 1046,
1544
+ "@gm": 385,
1545
+ "gma": 1026,
1546
+ "mary": 1336,
1547
+ "ary.": 544,
1548
+ "ry.s": 1937,
1549
+ "y.sm": 2360,
1550
+ ".smi": 322,
1551
+ "smit": 2016,
1552
+ "mith": 1358,
1553
+ "ith@": 1211,
1554
+ "th@g": 2127,
1555
+ "h@gm": 1047,
1556
+ "@gma": 386,
1557
+ "gmai": 1027,
1558
+ "m@": 1322,
1559
+ "oj": 1566,
1560
+ "v ": 2273,
1561
+ " te": 237,
1562
+ "tea": 2107,
1563
+ "eam": 795,
1564
+ "am@": 496,
1565
+ "m@p": 1323,
1566
+ "@pr": 399,
1567
+ "roj": 1892,
1568
+ "oje": 1567,
1569
+ "ct.": 681,
1570
+ "t.d": 2079,
1571
+ ".de": 299,
1572
+ "dev": 735,
1573
+ "ev ": 942,
1574
+ " tea": 238,
1575
+ "team": 2108,
1576
+ "eam@": 797,
1577
+ "am@p": 497,
1578
+ "m@pr": 1324,
1579
+ "@pro": 400,
1580
+ "proj": 1750,
1581
+ "roje": 1893,
1582
+ "ojec": 1568,
1583
+ "ect.": 813,
1584
+ "ct.d": 682,
1585
+ "t.de": 2080,
1586
+ ".dev": 300,
1587
+ "dev ": 736,
1588
+ "@r": 401,
1589
+ "sal": 1961,
1590
+ "ale": 487,
1591
+ "es@": 913,
1592
+ "s@r": 1954,
1593
+ "@re": 402,
1594
+ "ret": 1843,
1595
+ "eta": 933,
1596
+ "tai": 2091,
1597
+ "l.s": 1258,
1598
+ " sal": 209,
1599
+ "sale": 1963,
1600
+ "ales": 489,
1601
+ "les@": 1282,
1602
+ "es@r": 914,
1603
+ "s@re": 1955,
1604
+ "@ret": 403,
1605
+ "reta": 1844,
1606
+ "etai": 934,
1607
+ "tail": 2092,
1608
+ "il.s": 1138,
1609
+ "l.st": 1259,
1610
+ " me": 139,
1611
+ " me ": 140,
1612
+ "n.s": 1395,
1613
+ "h@p": 1048,
1614
+ "hn.s": 1092,
1615
+ "n.sm": 1396,
1616
+ "th@p": 2128,
1617
+ "h@pe": 1049,
1618
+ "ef": 834,
1619
+ "ref": 1829,
1620
+ "efe": 835,
1621
+ "fer": 974,
1622
+ " pre": 179,
1623
+ "pref": 1742,
1624
+ "refe": 1830,
1625
+ "efer": 836,
1626
+ "fer ": 975,
1627
+ "iv": 1216,
1628
+ "cu": 687,
1629
+ "pri": 1744,
1630
+ "riv": 1864,
1631
+ "iva": 1217,
1632
+ "vat": 2279,
1633
+ "te@": 2105,
1634
+ "e@s": 787,
1635
+ "@se": 405,
1636
+ "ecu": 814,
1637
+ "cur": 688,
1638
+ "re.": 1814,
1639
+ "e.m": 774,
1640
+ ".ma": 307,
1641
+ " pri": 180,
1642
+ "priv": 1745,
1643
+ "riva": 1865,
1644
+ "ivat": 1218,
1645
+ "vate": 2280,
1646
+ "ate@": 559,
1647
+ "te@s": 2106,
1648
+ "e@se": 788,
1649
+ "@sec": 406,
1650
+ "secu": 1981,
1651
+ "ecur": 815,
1652
+ "cure": 689,
1653
+ "ure.": 2253,
1654
+ "re.m": 1815,
1655
+ "e.ma": 775,
1656
+ ".mai": 308,
1657
+ "mm": 1359,
1658
+ "omm": 1581,
1659
+ "mmu": 1360,
1660
+ "mun": 1380,
1661
+ "uni": 2237,
1662
+ " com": 49,
1663
+ "comm": 666,
1664
+ "ommu": 1582,
1665
+ "mmun": 1361,
1666
+ "muni": 1381,
1667
+ "unic": 2238,
1668
+ "nica": 1464,
1669
+ "wo": 2329,
1670
+ "rk": 1866,
1671
+ " wo": 268,
1672
+ "wor": 2330,
1673
+ "ork": 1642,
1674
+ "rk ": 1867,
1675
+ " wor": 269,
1676
+ "work": 2331,
1677
+ "ork ": 1643,
1678
+ "j.": 1226,
1679
+ " j.": 117,
1680
+ "j.d": 1227,
1681
+ " j.d": 118,
1682
+ "j.do": 1228,
1683
+ "oe@c": 1554,
1684
+ "ws": 2335,
1685
+ "sl": 2011,
1686
+ "@u": 412,
1687
+ "ews": 955,
1688
+ "wsl": 2336,
1689
+ "sle": 2012,
1690
+ "let": 1283,
1691
+ "ett": 939,
1692
+ "r@u": 1782,
1693
+ "@up": 415,
1694
+ "tes": 2123,
1695
+ "es.": 911,
1696
+ "news": 1440,
1697
+ "ewsl": 956,
1698
+ "wsle": 2337,
1699
+ "slet": 2013,
1700
+ "lett": 1285,
1701
+ "ette": 940,
1702
+ "ter@": 2119,
1703
+ "er@u": 886,
1704
+ "r@up": 1783,
1705
+ "@upd": 416,
1706
+ "ates": 563,
1707
+ "tes.": 2124,
1708
+ "es.c": 912,
1709
+ "cr": 676,
1710
+ "ip": 1185,
1711
+ "ubs": 2197,
1712
+ "bsc": 605,
1713
+ "scr": 1969,
1714
+ "cri": 677,
1715
+ "rip": 1860,
1716
+ "ipt": 1186,
1717
+ "subs": 2063,
1718
+ "ubsc": 2198,
1719
+ "bscr": 606,
1720
+ "scri": 1970,
1721
+ "crip": 678,
1722
+ "ript": 1861,
1723
+ "ipti": 1187,
1724
+ "nv": 1518,
1725
+ "vo": 2302,
1726
+ "oi": 1563,
1727
+ "inv": 1175,
1728
+ "nvo": 1521,
1729
+ "voi": 2303,
1730
+ "oic": 1564,
1731
+ " inv": 111,
1732
+ "invo": 1177,
1733
+ "nvoi": 1522,
1734
+ "voic": 2304,
1735
+ "oice": 1565,
1736
+ "ices": 1115,
1737
+ "ces ": 641,
1738
+ "bi": 590,
1739
+ "g@": 1003,
1740
+ "@f": 381,
1741
+ " bi": 33,
1742
+ "bil": 591,
1743
+ "ill": 1141,
1744
+ "lli": 1296,
1745
+ "ng@": 1453,
1746
+ "g@f": 1004,
1747
+ "@fi": 382,
1748
+ "fin": 981,
1749
+ "ina": 1155,
1750
+ "nan": 1406,
1751
+ "ce.": 633,
1752
+ "e.o": 778,
1753
+ " bil": 34,
1754
+ "bill": 592,
1755
+ "illi": 1142,
1756
+ "llin": 1297,
1757
+ "ling": 1293,
1758
+ "ing@": 1168,
1759
+ "ng@f": 1454,
1760
+ "g@fi": 1005,
1761
+ "@fin": 383,
1762
+ "fina": 982,
1763
+ "inan": 1156,
1764
+ "nanc": 1407,
1765
+ "nce.": 1413,
1766
+ "ce.o": 635,
1767
+ "e.or": 779,
1768
+ "e.n": 776,
1769
+ "p@se": 1690,
1770
+ "@ser": 407,
1771
+ "ice.": 1114,
1772
+ "ce.n": 634,
1773
+ "e.ne": 777,
1774
+ " ou": 169,
1775
+ " our": 170,
1776
+ "ann": 514,
1777
+ "hann": 1057,
1778
+ "anne": 515,
1779
+ "tm": 2155,
1780
+ " de": 58,
1781
+ "dep": 728,
1782
+ "epa": 870,
1783
+ "par": 1695,
1784
+ "art": 540,
1785
+ "rtm": 1925,
1786
+ "tme": 2156,
1787
+ "men": 1347,
1788
+ "tal": 2095,
1789
+ "al ": 484,
1790
+ " dep": 59,
1791
+ "depa": 729,
1792
+ "epar": 871,
1793
+ "part": 1696,
1794
+ "artm": 542,
1795
+ "rtme": 1926,
1796
+ "tmen": 2157,
1797
+ "ment": 1348,
1798
+ "enta": 867,
1799
+ "ntal": 1505,
1800
+ "tal ": 2096,
1801
+ "ty": 2186,
1802
+ ".e": 303,
1803
+ "ese": 915,
1804
+ "sea": 1977,
1805
+ "ear": 798,
1806
+ "ch@": 645,
1807
+ "h@u": 1050,
1808
+ "@un": 413,
1809
+ "niv": 1470,
1810
+ "ive": 1219,
1811
+ "rsi": 1909,
1812
+ "sit": 2006,
1813
+ "ity": 1214,
1814
+ "ty.": 2187,
1815
+ "y.e": 2355,
1816
+ ".ed": 304,
1817
+ "du ": 764,
1818
+ "rese": 1840,
1819
+ "esea": 916,
1820
+ "sear": 1978,
1821
+ "earc": 799,
1822
+ "rch@": 1803,
1823
+ "ch@u": 646,
1824
+ "h@un": 1051,
1825
+ "@uni": 414,
1826
+ "univ": 2239,
1827
+ "nive": 1471,
1828
+ "iver": 1221,
1829
+ "vers": 2287,
1830
+ "ersi": 900,
1831
+ "rsit": 1910,
1832
+ "sity": 2007,
1833
+ "ity.": 1215,
1834
+ "ty.e": 2188,
1835
+ "y.ed": 2356,
1836
+ ".edu": 305,
1837
+ "edu ": 821,
1838
+ " cu": 52,
1839
+ "cus": 690,
1840
+ "ust": 2266,
1841
+ "tom": 2160,
1842
+ "ome": 1579,
1843
+ "mer": 1349,
1844
+ " cus": 53,
1845
+ "cust": 692,
1846
+ "usto": 2267,
1847
+ "stom": 2056,
1848
+ "tome": 2161,
1849
+ "omer": 1580,
1850
+ "mer ": 1350,
1851
+ "br": 601,
1852
+ "d.": 695,
1853
+ "re@": 1816,
1854
+ "e@b": 781,
1855
+ "@br": 365,
1856
+ "bra": 602,
1857
+ "ran": 1795,
1858
+ "nd.": 1417,
1859
+ "d.c": 696,
1860
+ "are@": 536,
1861
+ "re@b": 1817,
1862
+ "e@br": 782,
1863
+ "@bra": 366,
1864
+ "bran": 603,
1865
+ "rand": 1796,
1866
+ "and.": 505,
1867
+ "nd.c": 1418,
1868
+ "d.co": 697,
1869
+ "ach ": 446,
1870
+ "him": 1084,
1871
+ "im ": 1144,
1872
+ " him": 100,
1873
+ "him ": 1085,
1874
+ "id": 1116,
1875
+ "d@": 700,
1876
+ "g.": 1000,
1877
+ ".p": 317,
1878
+ "dav": 710,
1879
+ "vid": 2292,
1880
+ "id@": 1117,
1881
+ "d@c": 701,
1882
+ "nsu": 1497,
1883
+ "sul": 2064,
1884
+ "ult": 2227,
1885
+ "lti": 1313,
1886
+ "ng.": 1451,
1887
+ "g.p": 1001,
1888
+ ".pr": 318,
1889
+ "ro ": 1887,
1890
+ " dav": 56,
1891
+ "davi": 711,
1892
+ "avid": 575,
1893
+ "vid@": 2293,
1894
+ "id@c": 1118,
1895
+ "d@co": 702,
1896
+ "@con": 372,
1897
+ "cons": 672,
1898
+ "onsu": 1611,
1899
+ "nsul": 1498,
1900
+ "sult": 2065,
1901
+ "ulti": 2228,
1902
+ "ltin": 1314,
1903
+ "ing.": 1167,
1904
+ "ng.p": 1452,
1905
+ "g.pr": 1002,
1906
+ ".pro": 319,
1907
+ "pro ": 1747,
1908
+ "ot": 1659,
1909
+ "if": 1126,
1910
+ "@a": 361,
1911
+ "ts": 2174,
1912
+ ".a": 288,
1913
+ "p ": 1684,
1914
+ "not": 1484,
1915
+ "oti": 1660,
1916
+ "tif": 2138,
1917
+ "ifi": 1127,
1918
+ "fic": 977,
1919
+ "ns@": 1493,
1920
+ "s@a": 1952,
1921
+ "@al": 362,
1922
+ "ler": 1278,
1923
+ "ert": 902,
1924
+ "rts": 1927,
1925
+ "ts.": 2175,
1926
+ "s.a": 1945,
1927
+ ".ap": 289,
1928
+ "pp ": 1731,
1929
+ " not": 158,
1930
+ "noti": 1485,
1931
+ "otif": 1661,
1932
+ "tifi": 2139,
1933
+ "ific": 1128,
1934
+ "fica": 978,
1935
+ "ons@": 1609,
1936
+ "ns@a": 1494,
1937
+ "s@al": 1953,
1938
+ "@ale": 363,
1939
+ "aler": 488,
1940
+ "lert": 1279,
1941
+ "erts": 903,
1942
+ "rts.": 1928,
1943
+ "ts.a": 2176,
1944
+ "s.ap": 1946,
1945
+ ".app": 290,
1946
+ "app ": 523,
1947
+ "ends": 861,
1948
+ "sy": 2071,
1949
+ "ys": 2374,
1950
+ " sy": 231,
1951
+ "sys": 2072,
1952
+ "yst": 2375,
1953
+ "ste": 2050,
1954
+ "tem": 2115,
1955
+ "em ": 855,
1956
+ " sys": 232,
1957
+ "syst": 2073,
1958
+ "yste": 2376,
1959
+ "stem": 2052,
1960
+ "tem ": 2116,
1961
+ "ag": 466,
1962
+ "mes": 1351,
1963
+ "ssa": 2038,
1964
+ "sag": 1959,
1965
+ "age": 468,
1966
+ " mes": 142,
1967
+ "mess": 1352,
1968
+ "essa": 924,
1969
+ "ssag": 2039,
1970
+ "sage": 1960,
1971
+ "ages": 469,
1972
+ "dm": 748,
1973
+ "n@": 1400,
1974
+ "hb": 1065,
1975
+ "oa": 1536,
1976
+ ".t": 325,
1977
+ "adm": 461,
1978
+ "dmi": 749,
1979
+ "min": 1354,
1980
+ "in@": 1153,
1981
+ "n@d": 1401,
1982
+ "@da": 374,
1983
+ "das": 706,
1984
+ "ash": 551,
1985
+ "shb": 1994,
1986
+ "hbo": 1066,
1987
+ "boa": 597,
1988
+ "oar": 1537,
1989
+ "ard": 531,
1990
+ "rd.": 1806,
1991
+ "d.t": 698,
1992
+ ".to": 326,
1993
+ "too": 2163,
1994
+ "ool": 1620,
1995
+ "ol ": 1570,
1996
+ " adm": 17,
1997
+ "admi": 462,
1998
+ "dmin": 750,
1999
+ "min@": 1355,
2000
+ "in@d": 1154,
2001
+ "n@da": 1402,
2002
+ "@das": 375,
2003
+ "dash": 707,
2004
+ "ashb": 552,
2005
+ "shbo": 1995,
2006
+ "hboa": 1067,
2007
+ "boar": 598,
2008
+ "oard": 1538,
2009
+ "ard.": 532,
2010
+ "rd.t": 1807,
2011
+ "d.to": 699,
2012
+ ".too": 327,
2013
+ "tool": 2164,
2014
+ "ool ": 1621,
2015
+ " has": 92,
2016
+ "has ": 1059,
2017
+ "tr": 2169,
2018
+ "ini": 1170,
2019
+ "nis": 1468,
2020
+ "str": 2058,
2021
+ "tra": 2170,
2022
+ "rat": 1797,
2023
+ "tiv": 2150,
2024
+ "mini": 1356,
2025
+ "inis": 1172,
2026
+ "nist": 1469,
2027
+ "istr": 1203,
2028
+ "stra": 2059,
2029
+ "trat": 2173,
2030
+ "rati": 1799,
2031
+ "ativ": 566,
2032
+ "tive": 2151,
2033
+ "ive ": 1220,
2034
+ "cc": 628,
2035
+ " ac": 12,
2036
+ "acc": 443,
2037
+ "cce": 629,
2038
+ " acc": 13,
2039
+ "acce": 444,
2040
+ "cces": 630,
2041
+ "mee": 1345,
2042
+ " mee": 141,
2043
+ "meet": 1346,
2044
+ "omo": 1583,
2045
+ "rro": 1903,
2046
+ "row": 1898,
2047
+ " tom": 247,
2048
+ "tomo": 2162,
2049
+ "omor": 1584,
2050
+ "morr": 1369,
2051
+ "orro": 1650,
2052
+ "rrow": 1904,
2053
+ "row ": 1899,
2054
+ " 2": 1,
2055
+ "2p": 350,
2056
+ "pm": 1720,
2057
+ " 2p": 2,
2058
+ "2pm": 351,
2059
+ "pm ": 1721,
2060
+ " 2pm": 3,
2061
+ "2pm ": 352,
2062
+ "ua": 2191,
2063
+ "rl": 1870,
2064
+ "qua": 1765,
2065
+ "uar": 2192,
2066
+ "rte": 1923,
2067
+ "erl": 891,
2068
+ "rly": 1871,
2069
+ " qua": 190,
2070
+ "quar": 1766,
2071
+ "uart": 2193,
2072
+ "arte": 541,
2073
+ "rter": 1924,
2074
+ "terl": 2120,
2075
+ "erly": 892,
2076
+ "rly ": 1872,
2077
+ "q2": 1760,
2078
+ "2 ": 342,
2079
+ " q2": 185,
2080
+ "q2 ": 1761,
2081
+ " q2 ": 186,
2082
+ "ph": 1705,
2083
+ " ph": 174,
2084
+ "pha": 1706,
2085
+ " pha": 175,
2086
+ "phas": 1707,
2087
+ "ete": 935,
2088
+ "ompl": 1587,
2089
+ "plet": 1715,
2090
+ "lete": 1284,
2091
+ "ete ": 936,
2092
+ " ur": 256,
2093
+ "urg": 2255,
2094
+ "rge": 1853,
2095
+ "gen": 1013,
2096
+ " urg": 257,
2097
+ "urge": 2256,
2098
+ "rgen": 1854,
2099
+ "gent": 1014,
2100
+ "rve": 1930,
2101
+ "erve": 905,
2102
+ "rver": 1931,
2103
+ "ver ": 2286,
2104
+ "wn": 2326,
2105
+ " do": 62,
2106
+ "dow": 756,
2107
+ "own": 1681,
2108
+ "wnt": 2327,
2109
+ " dow": 63,
2110
+ "down": 757,
2111
+ "ownt": 1682,
2112
+ "wnti": 2328,
2113
+ "ntim": 1512,
2114
+ "nvi": 1519,
2115
+ "vit": 2300,
2116
+ "ita": 1206,
2117
+ "invi": 1176,
2118
+ "nvit": 1520,
2119
+ "vita": 2301,
2120
+ "itat": 1207,
2121
+ "tati": 2101,
2122
+ "ven": 2283,
2123
+ "even": 944,
2124
+ "vent": 2284,
2125
+ "w-": 2309,
2126
+ "-u": 284,
2127
+ "fol": 988,
2128
+ "oll": 1571,
2129
+ "low": 1303,
2130
+ "ow-": 1679,
2131
+ "w-u": 2310,
2132
+ "-up": 285,
2133
+ "up ": 2243,
2134
+ " fol": 82,
2135
+ "foll": 989,
2136
+ "ollo": 1572,
2137
+ "llow": 1300,
2138
+ "low-": 1304,
2139
+ "ow-u": 1680,
2140
+ "w-up": 2311,
2141
+ "-up ": 286,
2142
+ "rev": 1845,
2143
+ "evi": 946,
2144
+ "vio": 2298,
2145
+ "iou": 1183,
2146
+ "ous": 1670,
2147
+ "prev": 1743,
2148
+ "revi": 1846,
2149
+ "evio": 948,
2150
+ "viou": 2299,
2151
+ "ious": 1184,
2152
+ "ous ": 1671,
2153
+ "di": 740,
2154
+ " di": 60,
2155
+ "dis": 743,
2156
+ "isc": 1199,
2157
+ "scu": 1971,
2158
+ "uss": 2264,
2159
+ "sio": 2002,
2160
+ " dis": 61,
2161
+ "disc": 744,
2162
+ "iscu": 1200,
2163
+ "scus": 1972,
2164
+ "cuss": 691,
2165
+ "ussi": 2265,
2166
+ "ssio": 2043,
2167
+ "sion": 2003,
2168
+ "ob": 1539,
2169
+ "b ": 580,
2170
+ "job": 1233,
2171
+ "ob ": 1540,
2172
+ " job": 120,
2173
+ "job ": 1234,
2174
+ "tw": 2183,
2175
+ "wa": 2315,
2176
+ "sof": 2018,
2177
+ "oft": 1558,
2178
+ "ftw": 996,
2179
+ "twa": 2184,
2180
+ "war": 2316,
2181
+ " sof": 222,
2182
+ "soft": 2019,
2183
+ "oftw": 1559,
2184
+ "ftwa": 997,
2185
+ "twar": 2185,
2186
+ "ware": 2317,
2187
+ "gi": 1022,
2188
+ "eng": 862,
2189
+ "ngi": 1458,
2190
+ "gin": 1023,
2191
+ "nee": 1429,
2192
+ "eer": 830,
2193
+ " eng": 69,
2194
+ "engi": 863,
2195
+ "ngin": 1459,
2196
+ "gine": 1024,
2197
+ "inee": 1161,
2198
+ "neer": 1430,
2199
+ "eer ": 831,
2200
+ "eg": 837,
2201
+ "reg": 1831,
2202
+ "ega": 838,
2203
+ "gar": 1009,
2204
+ "rdi": 1810,
2205
+ "din": 741,
2206
+ " reg": 197,
2207
+ "rega": 1832,
2208
+ "egar": 839,
2209
+ "gard": 1010,
2210
+ "ardi": 533,
2211
+ "rdin": 1811,
2212
+ "ding": 742,
2213
+ "ew ": 950,
2214
+ "new ": 1438,
2215
+ " im": 105,
2216
+ "imp": 1147,
2217
+ "mpo": 1375,
2218
+ "rta": 1921,
2219
+ "ant": 516,
2220
+ " imp": 106,
2221
+ "impo": 1148,
2222
+ "mpor": 1376,
2223
+ "orta": 1655,
2224
+ "rtan": 1922,
2225
+ "tant": 2099,
2226
+ "ant ": 517,
2227
+ "cti": 685,
2228
+ " act": 14,
2229
+ "acti": 453,
2230
+ "ctio": 686,
2231
+ "ire": 1189,
2232
+ "red": 1825,
2233
+ "equi": 879,
2234
+ "uire": 2214,
2235
+ "ired": 1190,
2236
+ "red ": 1826,
2237
+ "orm": 1644,
2238
+ "rma": 1875,
2239
+ "nfor": 1448,
2240
+ "form": 992,
2241
+ "orma": 1645,
2242
+ "rmat": 1876,
2243
+ "mati": 1338,
2244
+ "hea": 1070,
2245
+ "ead": 792,
2246
+ "ade": 459,
2247
+ " hea": 95,
2248
+ "head": 1071,
2249
+ "eade": 794,
2250
+ "ader": 460,
2251
+ "der ": 731,
2252
+ "pi": 1708,
2253
+ "c ": 620,
2254
+ "top": 2165,
2255
+ "opi": 1627,
2256
+ "pic": 1709,
2257
+ "ic ": 1109,
2258
+ " top": 248,
2259
+ "topi": 2166,
2260
+ "opic": 1628,
2261
+ "pic ": 1710,
2262
+ "am ": 495,
2263
+ "eam ": 796,
2264
+ " bu": 35,
2265
+ "bui": 612,
2266
+ "uil": 2211,
2267
+ "ild": 1139,
2268
+ "ldi": 1267,
2269
+ " bui": 37,
2270
+ "buil": 613,
2271
+ "uild": 2212,
2272
+ "ildi": 1140,
2273
+ "ldin": 1268,
2274
+ "sv": 2068,
2275
+ "vp": 2305,
2276
+ " rs": 204,
2277
+ "rsv": 1913,
2278
+ "svp": 2069,
2279
+ "vp ": 2306,
2280
+ " rsv": 205,
2281
+ "rsvp": 1914,
2282
+ "svp ": 2070,
2283
+ "lab": 1261,
2284
+ "abe": 434,
2285
+ "bel": 586,
2286
+ " lab": 126,
2287
+ "labe": 1262,
2288
+ "abel": 435,
2289
+ "bel ": 587,
2290
+ "ud": 2204,
2291
+ "dg": 737,
2292
+ "bud": 610,
2293
+ "udg": 2205,
2294
+ "dge": 738,
2295
+ "get": 1017,
2296
+ " bud": 36,
2297
+ "budg": 611,
2298
+ "udge": 2206,
2299
+ "dget": 739,
2300
+ "get ": 1018,
2301
+ "ov": 1674,
2302
+ "rov": 1896,
2303
+ "ova": 1675,
2304
+ "val": 2277,
2305
+ "ppro": 1738,
2306
+ "prov": 1752,
2307
+ "rova": 1897,
2308
+ "oval": 1676,
2309
+ "val ": 2278,
2310
+ "q3": 1762,
2311
+ "3 ": 354,
2312
+ " q3": 187,
2313
+ "q3 ": 1763,
2314
+ " q3 ": 188,
2315
+ "tag": 2089,
2316
+ "ag ": 467,
2317
+ " tag": 235,
2318
+ "tag ": 2090,
2319
+ "os": 1656,
2320
+ "rop": 1894,
2321
+ "opo": 1629,
2322
+ "pos": 1728,
2323
+ "osa": 1657,
2324
+ "prop": 1751,
2325
+ "ropo": 1895,
2326
+ "opos": 1630,
2327
+ "posa": 1729,
2328
+ "osal": 1658,
2329
+ "sal ": 1962,
2330
+ "ark": 538,
2331
+ "rke": 1868,
2332
+ "ket": 1247,
2333
+ "mark": 1335,
2334
+ "arke": 539,
2335
+ "rket": 1869,
2336
+ "keti": 1248,
2337
+ "gy": 1039,
2338
+ "teg": 2111,
2339
+ "egy": 840,
2340
+ "gy ": 1040,
2341
+ " str": 227,
2342
+ "rate": 1798,
2343
+ "ateg": 561,
2344
+ "tegy": 2112,
2345
+ "egy ": 841,
2346
+ " tr": 249,
2347
+ "rai": 1793,
2348
+ " tra": 250,
2349
+ "trai": 2172,
2350
+ "rain": 1794,
2351
+ "aini": 479,
2352
+ "inin": 1171,
2353
+ " ses": 218,
2354
+ "sess": 1992,
2355
+ "essi": 926,
2356
+ "led": 1276,
2357
+ "uled": 2226,
2358
+ "led ": 1277,
2359
+ "ek": 842,
2360
+ " we": 264,
2361
+ "wee": 2319,
2362
+ "eek": 828,
2363
+ "ek ": 843,
2364
+ " wee": 265,
2365
+ "week": 2320,
2366
+ "eek ": 829,
2367
+ "ad ": 455,
2368
+ "read": 1820,
2369
+ "ead ": 793,
2370
+ "ntr": 1513,
2371
+ "rac": 1789,
2372
+ "ontr": 1615,
2373
+ "ntra": 1514,
2374
+ "trac": 2171,
2375
+ "ract": 1790,
2376
+ "vie": 2294,
2377
+ "iew": 1124,
2378
+ " rev": 201,
2379
+ "evie": 947,
2380
+ "view": 2295,
2381
+ "iew ": 1125,
2382
+ "este": 929,
2383
+ "sted": 2051
2384
+ },
2385
+ "response": null,
2386
+ "action_text": null
2387
+ }
components/train_DIETClassifier4/DIETClassifier.data_example.st ADDED
Binary file (2.44 kB). View file
 
components/train_DIETClassifier4/DIETClassifier.data_example_metadata.json ADDED
@@ -0,0 +1,179 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "key": "entities",
4
+ "components": [
5
+ {
6
+ "key": "entity",
7
+ "number_of_dimensions": 3,
8
+ "features": [
9
+ {
10
+ "type": "group",
11
+ "subcomponents": [
12
+ {
13
+ "type": "dense",
14
+ "key": "component_entities_entity_0_0_array",
15
+ "shape": [
16
+ 1,
17
+ 1
18
+ ]
19
+ }
20
+ ]
21
+ }
22
+ ]
23
+ },
24
+ {
25
+ "key": "mask",
26
+ "number_of_dimensions": 3,
27
+ "features": [
28
+ {
29
+ "type": "group",
30
+ "subcomponents": [
31
+ {
32
+ "type": "group",
33
+ "subcomponents": [
34
+ {
35
+ "type": "list",
36
+ "key": "component_entities_mask_0_0_0_list"
37
+ }
38
+ ]
39
+ }
40
+ ]
41
+ }
42
+ ]
43
+ }
44
+ ]
45
+ },
46
+ {
47
+ "key": "label",
48
+ "components": [
49
+ {
50
+ "key": "ids",
51
+ "number_of_dimensions": 2,
52
+ "features": [
53
+ {
54
+ "type": "group",
55
+ "subcomponents": [
56
+ {
57
+ "type": "list",
58
+ "key": "component_label_ids_0_0_list"
59
+ }
60
+ ]
61
+ }
62
+ ]
63
+ },
64
+ {
65
+ "key": "mask",
66
+ "number_of_dimensions": 3,
67
+ "features": [
68
+ {
69
+ "type": "group",
70
+ "subcomponents": [
71
+ {
72
+ "type": "group",
73
+ "subcomponents": [
74
+ {
75
+ "type": "list",
76
+ "key": "component_label_mask_0_0_0_list"
77
+ }
78
+ ]
79
+ }
80
+ ]
81
+ }
82
+ ]
83
+ },
84
+ {
85
+ "key": "sentence",
86
+ "number_of_dimensions": 3,
87
+ "features": [
88
+ {
89
+ "type": "group",
90
+ "subcomponents": [
91
+ {
92
+ "type": "group",
93
+ "subcomponents": [
94
+ {
95
+ "type": "list",
96
+ "key": "component_label_sentence_0_0_0_list"
97
+ }
98
+ ]
99
+ }
100
+ ]
101
+ }
102
+ ]
103
+ }
104
+ ]
105
+ },
106
+ {
107
+ "key": "text",
108
+ "components": [
109
+ {
110
+ "key": "mask",
111
+ "number_of_dimensions": 3,
112
+ "features": [
113
+ {
114
+ "type": "group",
115
+ "subcomponents": [
116
+ {
117
+ "type": "group",
118
+ "subcomponents": [
119
+ {
120
+ "type": "list",
121
+ "key": "component_text_mask_0_0_0_list"
122
+ }
123
+ ]
124
+ }
125
+ ]
126
+ }
127
+ ]
128
+ },
129
+ {
130
+ "key": "sentence",
131
+ "number_of_dimensions": 3,
132
+ "features": [
133
+ {
134
+ "type": "group",
135
+ "subcomponents": [
136
+ {
137
+ "type": "sparse",
138
+ "key": "component_text_sentence_0_0",
139
+ "shape": [
140
+ 1,
141
+ 2381
142
+ ]
143
+ }
144
+ ]
145
+ }
146
+ ]
147
+ },
148
+ {
149
+ "key": "sequence",
150
+ "number_of_dimensions": 3,
151
+ "features": [
152
+ {
153
+ "type": "group",
154
+ "subcomponents": [
155
+ {
156
+ "type": "sparse",
157
+ "key": "component_text_sequence_0_0",
158
+ "shape": [
159
+ 1,
160
+ 2405
161
+ ]
162
+ }
163
+ ]
164
+ }
165
+ ]
166
+ },
167
+ {
168
+ "key": "sequence_lengths",
169
+ "number_of_dimensions": 1,
170
+ "features": [
171
+ {
172
+ "type": "list",
173
+ "key": "component_text_sequence_lengths_0_list"
174
+ }
175
+ ]
176
+ }
177
+ ]
178
+ }
179
+ ]
components/train_DIETClassifier4/DIETClassifier.entity_tag_specs.json ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "tag_name": "entity",
4
+ "ids_to_tags": {
5
+ "1": "B-email",
6
+ "2": "I-email",
7
+ "3": "L-email",
8
+ "4": "U-email",
9
+ "5": "B-subject",
10
+ "6": "I-subject",
11
+ "7": "L-subject",
12
+ "8": "U-subject",
13
+ "0": "O"
14
+ },
15
+ "tags_to_ids": {
16
+ "B-email": 1,
17
+ "I-email": 2,
18
+ "L-email": 3,
19
+ "U-email": 4,
20
+ "B-subject": 5,
21
+ "I-subject": 6,
22
+ "L-subject": 7,
23
+ "U-subject": 8,
24
+ "O": 0
25
+ },
26
+ "num_tags": 9
27
+ }
28
+ ]
components/train_DIETClassifier4/DIETClassifier.index_label_id_mapping.json ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "0": "confirm",
3
+ "1": "goodbye",
4
+ "2": "greeting",
5
+ "3": "provide_email",
6
+ "4": "provide_subject",
7
+ "5": "thank_you"
8
+ }
components/train_DIETClassifier4/DIETClassifier.label_data.st ADDED
Binary file (1.23 kB). View file
 
components/train_DIETClassifier4/DIETClassifier.label_data_metadata.json ADDED
@@ -0,0 +1,107 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "key": "label",
4
+ "components": [
5
+ {
6
+ "key": "sentence",
7
+ "number_of_dimensions": 3,
8
+ "features": [
9
+ {
10
+ "type": "group",
11
+ "subcomponents": [
12
+ {
13
+ "type": "group",
14
+ "subcomponents": [
15
+ {
16
+ "type": "list",
17
+ "key": "component_label_sentence_0_0_0_list"
18
+ }
19
+ ]
20
+ },
21
+ {
22
+ "type": "group",
23
+ "subcomponents": [
24
+ {
25
+ "type": "list",
26
+ "key": "component_label_sentence_0_1_0_list"
27
+ }
28
+ ]
29
+ },
30
+ {
31
+ "type": "group",
32
+ "subcomponents": [
33
+ {
34
+ "type": "list",
35
+ "key": "component_label_sentence_0_2_0_list"
36
+ }
37
+ ]
38
+ },
39
+ {
40
+ "type": "group",
41
+ "subcomponents": [
42
+ {
43
+ "type": "list",
44
+ "key": "component_label_sentence_0_3_0_list"
45
+ }
46
+ ]
47
+ },
48
+ {
49
+ "type": "group",
50
+ "subcomponents": [
51
+ {
52
+ "type": "list",
53
+ "key": "component_label_sentence_0_4_0_list"
54
+ }
55
+ ]
56
+ },
57
+ {
58
+ "type": "group",
59
+ "subcomponents": [
60
+ {
61
+ "type": "list",
62
+ "key": "component_label_sentence_0_5_0_list"
63
+ }
64
+ ]
65
+ }
66
+ ]
67
+ }
68
+ ]
69
+ },
70
+ {
71
+ "key": "ids",
72
+ "number_of_dimensions": 2,
73
+ "features": [
74
+ {
75
+ "type": "group",
76
+ "subcomponents": [
77
+ {
78
+ "type": "list",
79
+ "key": "component_label_ids_0_0_list"
80
+ },
81
+ {
82
+ "type": "list",
83
+ "key": "component_label_ids_0_1_list"
84
+ },
85
+ {
86
+ "type": "list",
87
+ "key": "component_label_ids_0_2_list"
88
+ },
89
+ {
90
+ "type": "list",
91
+ "key": "component_label_ids_0_3_list"
92
+ },
93
+ {
94
+ "type": "list",
95
+ "key": "component_label_ids_0_4_list"
96
+ },
97
+ {
98
+ "type": "list",
99
+ "key": "component_label_ids_0_5_list"
100
+ }
101
+ ]
102
+ }
103
+ ]
104
+ }
105
+ ]
106
+ }
107
+ ]
components/train_DIETClassifier4/DIETClassifier.sparse_feature_sizes.json ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "text": {
3
+ "sequence": [
4
+ 24,
5
+ 2381
6
+ ],
7
+ "sentence": [
8
+ 2381
9
+ ]
10
+ }
11
+ }
components/train_DIETClassifier4/DIETClassifier.tf_model.data-00000-of-00001 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:681f3f21d4ff9a579deac2433b591bd011716b3c105582b50044426374c16e34
3
+ size 33243823
components/train_DIETClassifier4/DIETClassifier.tf_model.index ADDED
Binary file (10.5 kB). View file
 
components/train_DIETClassifier4/checkpoint ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ model_checkpoint_path: "DIETClassifier.tf_model"
2
+ all_model_checkpoint_paths: "DIETClassifier.tf_model"
components/train_LexicalSyntacticFeaturizer2/feature_to_idx_dict.json ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "0###low": {
3
+ "False": 0,
4
+ "True": 1
5
+ },
6
+ "0###title": {
7
+ "False": 2,
8
+ "True": 3
9
+ },
10
+ "0###upper": {
11
+ "False": 4,
12
+ "True": 5
13
+ },
14
+ "1###BOS": {
15
+ "False": 6,
16
+ "True": 7
17
+ },
18
+ "1###EOS": {
19
+ "False": 8,
20
+ "True": 9
21
+ },
22
+ "1###digit": {
23
+ "False": 10,
24
+ "True": 11
25
+ },
26
+ "1###low": {
27
+ "False": 12,
28
+ "True": 13
29
+ },
30
+ "1###title": {
31
+ "False": 14,
32
+ "True": 15
33
+ },
34
+ "1###upper": {
35
+ "False": 16,
36
+ "True": 17
37
+ },
38
+ "2###low": {
39
+ "False": 18,
40
+ "True": 19
41
+ },
42
+ "2###title": {
43
+ "False": 20,
44
+ "True": 21
45
+ },
46
+ "2###upper": {
47
+ "False": 22,
48
+ "True": 23
49
+ }
50
+ }
components/train_RegexFeaturizer1/patterns.json ADDED
@@ -0,0 +1 @@
 
 
1
+ []
metadata.json ADDED
@@ -0,0 +1,308 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "domain": {
3
+ "session_config": {
4
+ "session_expiration_time": 60,
5
+ "carry_over_slots_to_new_session": true
6
+ },
7
+ "version": "3.1"
8
+ },
9
+ "trained_at": "2025-04-24T04:19:29.585789",
10
+ "model_id": "7415b87aefe24a37a96471238c970e47",
11
+ "assistant_id": "20250424-041832-kinetic-diamond",
12
+ "rasa_open_source_version": "3.6.21",
13
+ "train_schema": {
14
+ "nodes": {
15
+ "schema_validator": {
16
+ "needs": {
17
+ "importer": "__importer__"
18
+ },
19
+ "uses": "rasa.graph_components.validators.default_recipe_validator.DefaultV1RecipeValidator",
20
+ "constructor_name": "create",
21
+ "fn": "validate",
22
+ "config": {},
23
+ "eager": false,
24
+ "is_target": false,
25
+ "is_input": true,
26
+ "resource": null
27
+ },
28
+ "finetuning_validator": {
29
+ "needs": {
30
+ "importer": "schema_validator"
31
+ },
32
+ "uses": "rasa.graph_components.validators.finetuning_validator.FinetuningValidator",
33
+ "constructor_name": "create",
34
+ "fn": "validate",
35
+ "config": {
36
+ "validate_core": false,
37
+ "validate_nlu": true
38
+ },
39
+ "eager": false,
40
+ "is_target": false,
41
+ "is_input": true,
42
+ "resource": null
43
+ },
44
+ "nlu_training_data_provider": {
45
+ "needs": {
46
+ "importer": "finetuning_validator"
47
+ },
48
+ "uses": "rasa.graph_components.providers.nlu_training_data_provider.NLUTrainingDataProvider",
49
+ "constructor_name": "create",
50
+ "fn": "provide",
51
+ "config": {
52
+ "language": "en",
53
+ "persist": false
54
+ },
55
+ "eager": false,
56
+ "is_target": false,
57
+ "is_input": true,
58
+ "resource": null
59
+ },
60
+ "run_WhitespaceTokenizer0": {
61
+ "needs": {
62
+ "training_data": "nlu_training_data_provider"
63
+ },
64
+ "uses": "rasa.nlu.tokenizers.whitespace_tokenizer.WhitespaceTokenizer",
65
+ "constructor_name": "load",
66
+ "fn": "process_training_data",
67
+ "config": {},
68
+ "eager": false,
69
+ "is_target": false,
70
+ "is_input": false,
71
+ "resource": null
72
+ },
73
+ "train_RegexFeaturizer1": {
74
+ "needs": {
75
+ "training_data": "run_WhitespaceTokenizer0"
76
+ },
77
+ "uses": "rasa.nlu.featurizers.sparse_featurizer.regex_featurizer.RegexFeaturizer",
78
+ "constructor_name": "create",
79
+ "fn": "train",
80
+ "config": {},
81
+ "eager": false,
82
+ "is_target": true,
83
+ "is_input": false,
84
+ "resource": null
85
+ },
86
+ "run_RegexFeaturizer1": {
87
+ "needs": {
88
+ "training_data": "run_WhitespaceTokenizer0",
89
+ "resource": "train_RegexFeaturizer1"
90
+ },
91
+ "uses": "rasa.nlu.featurizers.sparse_featurizer.regex_featurizer.RegexFeaturizer",
92
+ "constructor_name": "load",
93
+ "fn": "process_training_data",
94
+ "config": {},
95
+ "eager": false,
96
+ "is_target": false,
97
+ "is_input": false,
98
+ "resource": null
99
+ },
100
+ "train_LexicalSyntacticFeaturizer2": {
101
+ "needs": {
102
+ "training_data": "run_RegexFeaturizer1"
103
+ },
104
+ "uses": "rasa.nlu.featurizers.sparse_featurizer.lexical_syntactic_featurizer.LexicalSyntacticFeaturizer",
105
+ "constructor_name": "create",
106
+ "fn": "train",
107
+ "config": {},
108
+ "eager": false,
109
+ "is_target": true,
110
+ "is_input": false,
111
+ "resource": null
112
+ },
113
+ "run_LexicalSyntacticFeaturizer2": {
114
+ "needs": {
115
+ "training_data": "run_RegexFeaturizer1",
116
+ "resource": "train_LexicalSyntacticFeaturizer2"
117
+ },
118
+ "uses": "rasa.nlu.featurizers.sparse_featurizer.lexical_syntactic_featurizer.LexicalSyntacticFeaturizer",
119
+ "constructor_name": "load",
120
+ "fn": "process_training_data",
121
+ "config": {},
122
+ "eager": false,
123
+ "is_target": false,
124
+ "is_input": false,
125
+ "resource": null
126
+ },
127
+ "train_CountVectorsFeaturizer3": {
128
+ "needs": {
129
+ "training_data": "run_LexicalSyntacticFeaturizer2"
130
+ },
131
+ "uses": "rasa.nlu.featurizers.sparse_featurizer.count_vectors_featurizer.CountVectorsFeaturizer",
132
+ "constructor_name": "create",
133
+ "fn": "train",
134
+ "config": {
135
+ "analyzer": "char_wb",
136
+ "max_ngram": 4,
137
+ "min_ngram": 1
138
+ },
139
+ "eager": false,
140
+ "is_target": true,
141
+ "is_input": false,
142
+ "resource": null
143
+ },
144
+ "run_CountVectorsFeaturizer3": {
145
+ "needs": {
146
+ "training_data": "run_LexicalSyntacticFeaturizer2",
147
+ "resource": "train_CountVectorsFeaturizer3"
148
+ },
149
+ "uses": "rasa.nlu.featurizers.sparse_featurizer.count_vectors_featurizer.CountVectorsFeaturizer",
150
+ "constructor_name": "load",
151
+ "fn": "process_training_data",
152
+ "config": {
153
+ "analyzer": "char_wb",
154
+ "max_ngram": 4,
155
+ "min_ngram": 1
156
+ },
157
+ "eager": false,
158
+ "is_target": false,
159
+ "is_input": false,
160
+ "resource": null
161
+ },
162
+ "train_DIETClassifier4": {
163
+ "needs": {
164
+ "training_data": "run_CountVectorsFeaturizer3"
165
+ },
166
+ "uses": "rasa.nlu.classifiers.diet_classifier.DIETClassifier",
167
+ "constructor_name": "create",
168
+ "fn": "train",
169
+ "config": {
170
+ "constrain_similarities": true,
171
+ "entity_recognition": true,
172
+ "entity_recognition_confidence_threshold": 0.6,
173
+ "epochs": 150,
174
+ "intent_classification_confidence_threshold": 0.7,
175
+ "model_confidence": "softmax"
176
+ },
177
+ "eager": false,
178
+ "is_target": true,
179
+ "is_input": false,
180
+ "resource": null
181
+ }
182
+ }
183
+ },
184
+ "predict_schema": {
185
+ "nodes": {
186
+ "nlu_message_converter": {
187
+ "needs": {
188
+ "messages": "__message__"
189
+ },
190
+ "uses": "rasa.graph_components.converters.nlu_message_converter.NLUMessageConverter",
191
+ "constructor_name": "load",
192
+ "fn": "convert_user_message",
193
+ "config": {},
194
+ "eager": true,
195
+ "is_target": false,
196
+ "is_input": false,
197
+ "resource": null
198
+ },
199
+ "run_WhitespaceTokenizer0": {
200
+ "needs": {
201
+ "messages": "nlu_message_converter"
202
+ },
203
+ "uses": "rasa.nlu.tokenizers.whitespace_tokenizer.WhitespaceTokenizer",
204
+ "constructor_name": "load",
205
+ "fn": "process",
206
+ "config": {},
207
+ "eager": true,
208
+ "is_target": false,
209
+ "is_input": false,
210
+ "resource": null
211
+ },
212
+ "run_RegexFeaturizer1": {
213
+ "needs": {
214
+ "messages": "run_WhitespaceTokenizer0"
215
+ },
216
+ "uses": "rasa.nlu.featurizers.sparse_featurizer.regex_featurizer.RegexFeaturizer",
217
+ "constructor_name": "load",
218
+ "fn": "process",
219
+ "config": {},
220
+ "eager": true,
221
+ "is_target": false,
222
+ "is_input": false,
223
+ "resource": {
224
+ "name": "train_RegexFeaturizer1",
225
+ "output_fingerprint": "8338decfe4044370959a64a7613a845a"
226
+ }
227
+ },
228
+ "run_LexicalSyntacticFeaturizer2": {
229
+ "needs": {
230
+ "messages": "run_RegexFeaturizer1"
231
+ },
232
+ "uses": "rasa.nlu.featurizers.sparse_featurizer.lexical_syntactic_featurizer.LexicalSyntacticFeaturizer",
233
+ "constructor_name": "load",
234
+ "fn": "process",
235
+ "config": {},
236
+ "eager": true,
237
+ "is_target": false,
238
+ "is_input": false,
239
+ "resource": {
240
+ "name": "train_LexicalSyntacticFeaturizer2",
241
+ "output_fingerprint": "0b51ac51c95d455f802036bc7d252c20"
242
+ }
243
+ },
244
+ "run_CountVectorsFeaturizer3": {
245
+ "needs": {
246
+ "messages": "run_LexicalSyntacticFeaturizer2"
247
+ },
248
+ "uses": "rasa.nlu.featurizers.sparse_featurizer.count_vectors_featurizer.CountVectorsFeaturizer",
249
+ "constructor_name": "load",
250
+ "fn": "process",
251
+ "config": {
252
+ "analyzer": "char_wb",
253
+ "max_ngram": 4,
254
+ "min_ngram": 1
255
+ },
256
+ "eager": true,
257
+ "is_target": false,
258
+ "is_input": false,
259
+ "resource": {
260
+ "name": "train_CountVectorsFeaturizer3",
261
+ "output_fingerprint": "c3c86c8a95994de6bc795e2a8096b064"
262
+ }
263
+ },
264
+ "run_DIETClassifier4": {
265
+ "needs": {
266
+ "messages": "run_CountVectorsFeaturizer3"
267
+ },
268
+ "uses": "rasa.nlu.classifiers.diet_classifier.DIETClassifier",
269
+ "constructor_name": "load",
270
+ "fn": "process",
271
+ "config": {
272
+ "constrain_similarities": true,
273
+ "entity_recognition": true,
274
+ "entity_recognition_confidence_threshold": 0.6,
275
+ "epochs": 150,
276
+ "intent_classification_confidence_threshold": 0.7,
277
+ "model_confidence": "softmax"
278
+ },
279
+ "eager": true,
280
+ "is_target": false,
281
+ "is_input": false,
282
+ "resource": {
283
+ "name": "train_DIETClassifier4",
284
+ "output_fingerprint": "00109f576b0f4e118a8460740a58446d"
285
+ }
286
+ },
287
+ "run_RegexMessageHandler": {
288
+ "needs": {
289
+ "messages": "run_DIETClassifier4"
290
+ },
291
+ "uses": "rasa.nlu.classifiers.regex_message_handler.RegexMessageHandler",
292
+ "constructor_name": "load",
293
+ "fn": "process",
294
+ "config": {},
295
+ "eager": true,
296
+ "is_target": false,
297
+ "is_input": false,
298
+ "resource": null
299
+ }
300
+ }
301
+ },
302
+ "training_type": 1,
303
+ "project_fingerprint": "b8131c59a6ba790d9cabcb5ae95657db0bbdb4aaea22cf21d67ce13983d73da2",
304
+ "core_target": null,
305
+ "nlu_target": "run_RegexMessageHandler",
306
+ "language": "en",
307
+ "spaces": null
308
+ }