hyoo14 commited on
Commit
0b23564
·
verified ·
1 Parent(s): 1ea96bf

Upload tokenizer

Browse files
Files changed (3) hide show
  1. special_tokens_map.json +23 -0
  2. tokenizer.json +3184 -0
  3. tokenizer_config.json +59 -0
special_tokens_map.json ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ }
23
+ }
tokenizer.json ADDED
@@ -0,0 +1,3184 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "version": "1.0",
3
+ "truncation": {
4
+ "direction": "Right",
5
+ "max_length": 512,
6
+ "strategy": "LongestFirst",
7
+ "stride": 0
8
+ },
9
+ "padding": {
10
+ "strategy": "BatchLongest",
11
+ "direction": "Right",
12
+ "pad_to_multiple_of": null,
13
+ "pad_id": 0,
14
+ "pad_type_id": 0,
15
+ "pad_token": "[PAD]"
16
+ },
17
+ "added_tokens": [
18
+ {
19
+ "id": 0,
20
+ "content": "[PAD]",
21
+ "single_word": false,
22
+ "lstrip": false,
23
+ "rstrip": false,
24
+ "normalized": false,
25
+ "special": true
26
+ },
27
+ {
28
+ "id": 1,
29
+ "content": "[UNK]",
30
+ "single_word": false,
31
+ "lstrip": false,
32
+ "rstrip": false,
33
+ "normalized": false,
34
+ "special": true
35
+ },
36
+ {
37
+ "id": 2,
38
+ "content": "[CLS]",
39
+ "single_word": false,
40
+ "lstrip": false,
41
+ "rstrip": false,
42
+ "normalized": false,
43
+ "special": true
44
+ },
45
+ {
46
+ "id": 3,
47
+ "content": "[SEP]",
48
+ "single_word": false,
49
+ "lstrip": false,
50
+ "rstrip": false,
51
+ "normalized": false,
52
+ "special": true
53
+ },
54
+ {
55
+ "id": 4,
56
+ "content": "[MASK]",
57
+ "single_word": false,
58
+ "lstrip": false,
59
+ "rstrip": false,
60
+ "normalized": false,
61
+ "special": true
62
+ }
63
+ ],
64
+ "normalizer": null,
65
+ "pre_tokenizer": {
66
+ "type": "Whitespace"
67
+ },
68
+ "post_processor": {
69
+ "type": "TemplateProcessing",
70
+ "single": [
71
+ {
72
+ "SpecialToken": {
73
+ "id": "[CLS]",
74
+ "type_id": 0
75
+ }
76
+ },
77
+ {
78
+ "Sequence": {
79
+ "id": "A",
80
+ "type_id": 0
81
+ }
82
+ },
83
+ {
84
+ "SpecialToken": {
85
+ "id": "[SEP]",
86
+ "type_id": 0
87
+ }
88
+ }
89
+ ],
90
+ "pair": [
91
+ {
92
+ "SpecialToken": {
93
+ "id": "[CLS]",
94
+ "type_id": 0
95
+ }
96
+ },
97
+ {
98
+ "Sequence": {
99
+ "id": "A",
100
+ "type_id": 0
101
+ }
102
+ },
103
+ {
104
+ "SpecialToken": {
105
+ "id": "[SEP]",
106
+ "type_id": 0
107
+ }
108
+ },
109
+ {
110
+ "Sequence": {
111
+ "id": "B",
112
+ "type_id": 1
113
+ }
114
+ },
115
+ {
116
+ "SpecialToken": {
117
+ "id": "[SEP]",
118
+ "type_id": 1
119
+ }
120
+ }
121
+ ],
122
+ "special_tokens": {
123
+ "[CLS]": {
124
+ "id": "[CLS]",
125
+ "ids": [
126
+ 2
127
+ ],
128
+ "tokens": [
129
+ "[CLS]"
130
+ ]
131
+ },
132
+ "[SEP]": {
133
+ "id": "[SEP]",
134
+ "ids": [
135
+ 3
136
+ ],
137
+ "tokens": [
138
+ "[SEP]"
139
+ ]
140
+ }
141
+ }
142
+ },
143
+ "decoder": null,
144
+ "model": {
145
+ "type": "BPE",
146
+ "dropout": null,
147
+ "unk_token": "[UNK]",
148
+ "continuing_subword_prefix": null,
149
+ "end_of_word_suffix": null,
150
+ "fuse_unk": false,
151
+ "byte_fallback": false,
152
+ "ignore_merges": false,
153
+ "vocab": {
154
+ "[PAD]": 0,
155
+ "[UNK]": 1,
156
+ "[CLS]": 2,
157
+ "[SEP]": 3,
158
+ "[MASK]": 4,
159
+ "A": 5,
160
+ "C": 6,
161
+ "G": 7,
162
+ "T": 8,
163
+ "TT": 9,
164
+ "AA": 10,
165
+ "TG": 11,
166
+ "AG": 12,
167
+ "CC": 13,
168
+ "TC": 14,
169
+ "AC": 15,
170
+ "GG": 16,
171
+ "ATT": 17,
172
+ "AT": 18,
173
+ "ATG": 19,
174
+ "GC": 20,
175
+ "TAA": 21,
176
+ "TCC": 22,
177
+ "ACC": 23,
178
+ "AAAA": 24,
179
+ "AGG": 25,
180
+ "ATC": 26,
181
+ "AGC": 27,
182
+ "TTC": 28,
183
+ "AAG": 29,
184
+ "TTTT": 30,
185
+ "TGC": 31,
186
+ "TGG": 32,
187
+ "AAC": 33,
188
+ "TTG": 34,
189
+ "TAG": 35,
190
+ "TAC": 36,
191
+ "CCC": 37,
192
+ "TATT": 38,
193
+ "TGGG": 39,
194
+ "TAT": 40,
195
+ "AGAA": 41,
196
+ "AGGG": 42,
197
+ "TTTC": 43,
198
+ "AGGC": 44,
199
+ "AGCC": 45,
200
+ "ATAA": 46,
201
+ "TGTG": 47,
202
+ "TTGG": 48,
203
+ "ATTC": 49,
204
+ "AAGG": 50,
205
+ "ACAC": 51,
206
+ "TCCC": 52,
207
+ "TCTC": 53,
208
+ "TATG": 54,
209
+ "TTTG": 55,
210
+ "TTCC": 56,
211
+ "AGTG": 57,
212
+ "ATGG": 58,
213
+ "AGAC": 59,
214
+ "AAAC": 60,
215
+ "ACCC": 61,
216
+ "TGCC": 62,
217
+ "ATTG": 63,
218
+ "ATCC": 64,
219
+ "AGAG": 65,
220
+ "ATGC": 66,
221
+ "ATAC": 67,
222
+ "TCTG": 68,
223
+ "TTAA": 69,
224
+ "TCAC": 70,
225
+ "TGAA": 71,
226
+ "TGGC": 72,
227
+ "TTGC": 73,
228
+ "TAAG": 74,
229
+ "TATC": 75,
230
+ "TAAC": 76,
231
+ "AAAG": 77,
232
+ "TTAC": 78,
233
+ "AAGC": 79,
234
+ "GGG": 80,
235
+ "TAGC": 81,
236
+ "GGC": 82,
237
+ "ATAT": 83,
238
+ "TACC": 84,
239
+ "AACC": 85,
240
+ "AATG": 86,
241
+ "TAGG": 87,
242
+ "GCC": 88,
243
+ "ATATT": 89,
244
+ "AGTC": 90,
245
+ "TTTTC": 91,
246
+ "AAAAC": 92,
247
+ "TGAC": 93,
248
+ "TTTAA": 94,
249
+ "AAAAG": 95,
250
+ "AATC": 96,
251
+ "TGTC": 97,
252
+ "TTATT": 98,
253
+ "ATAG": 99,
254
+ "TGAG": 100,
255
+ "TTTTG": 101,
256
+ "AAATT": 102,
257
+ "AATT": 103,
258
+ "AATAA": 104,
259
+ "TTTCC": 105,
260
+ "ACAG": 106,
261
+ "TCAG": 107,
262
+ "AAATG": 108,
263
+ "TGGGC": 109,
264
+ "ACTC": 110,
265
+ "AGGCC": 111,
266
+ "TTAG": 112,
267
+ "ACTG": 113,
268
+ "ACG": 114,
269
+ "ATATG": 115,
270
+ "TGGCC": 116,
271
+ "ATTTC": 117,
272
+ "ACAA": 118,
273
+ "ATCTC": 119,
274
+ "TATTC": 120,
275
+ "TGTAA": 121,
276
+ "ACTT": 122,
277
+ "ATGCC": 123,
278
+ "TAAAA": 124,
279
+ "AAAAAAAA": 125,
280
+ "ATTCC": 126,
281
+ "TTTAG": 127,
282
+ "TCCCC": 128,
283
+ "TTTGC": 129,
284
+ "TTCCC": 130,
285
+ "TGGGG": 131,
286
+ "TTCTC": 132,
287
+ "ATAAAA": 133,
288
+ "AGAAG": 134,
289
+ "TTTTTTTT": 135,
290
+ "ACCCC": 136,
291
+ "AGGGC": 137,
292
+ "ACCTC": 138,
293
+ "AGATG": 139,
294
+ "ATTAC": 140,
295
+ "AAGCC": 141,
296
+ "GGCC": 142,
297
+ "AGGAG": 143,
298
+ "TCAA": 144,
299
+ "ATTGC": 145,
300
+ "TATTG": 146,
301
+ "ATAAC": 147,
302
+ "ATATC": 148,
303
+ "TTTAC": 149,
304
+ "ATGGC": 150,
305
+ "AAGGC": 151,
306
+ "ACCAC": 152,
307
+ "GTG": 153,
308
+ "ATCCC": 154,
309
+ "AGAAC": 155,
310
+ "ATTTT": 156,
311
+ "TTGCC": 157,
312
+ "AAATC": 158,
313
+ "ATAAG": 159,
314
+ "TTGGC": 160,
315
+ "TGGAG": 161,
316
+ "ATGGG": 162,
317
+ "AAAGC": 163,
318
+ "AGGGG": 164,
319
+ "ATCAC": 165,
320
+ "ATTTG": 166,
321
+ "AATTC": 167,
322
+ "TGCAC": 168,
323
+ "TTTGG": 169,
324
+ "TCG": 170,
325
+ "AGAGC": 171,
326
+ "AAAGG": 172,
327
+ "GGGC": 173,
328
+ "TTGGG": 174,
329
+ "AGAAAA": 175,
330
+ "TATCC": 176,
331
+ "TCTCC": 177,
332
+ "ATAGC": 178,
333
+ "TGAGG": 179,
334
+ "TTTATT": 180,
335
+ "AGTAA": 181,
336
+ "AGAGG": 182,
337
+ "TCTTC": 183,
338
+ "ACATT": 184,
339
+ "TCCTG": 185,
340
+ "AGCCC": 186,
341
+ "TATGC": 187,
342
+ "TTAAAA": 188,
343
+ "AGATT": 189,
344
+ "TTAAC": 190,
345
+ "GGGG": 191,
346
+ "AAGAC": 192,
347
+ "TCATT": 193,
348
+ "TTCTG": 194,
349
+ "AGACC": 195,
350
+ "AAGGG": 196,
351
+ "ATACC": 197,
352
+ "TTTAT": 198,
353
+ "AAGTG": 199,
354
+ "TTATG": 200,
355
+ "AAGAA": 201,
356
+ "TAGCC": 202,
357
+ "TTCAC": 203,
358
+ "AGGTG": 204,
359
+ "TTGAA": 205,
360
+ "ATCTG": 206,
361
+ "AGCAC": 207,
362
+ "TGCTG": 208,
363
+ "AAACC": 209,
364
+ "ATGTG": 210,
365
+ "TTTTCC": 211,
366
+ "AGTTC": 212,
367
+ "TCCTC": 213,
368
+ "TATGG": 214,
369
+ "AATAC": 215,
370
+ "AGTGG": 216,
371
+ "TAGGC": 217,
372
+ "AGCTC": 218,
373
+ "ATAGG": 219,
374
+ "TTATC": 220,
375
+ "TTAAG": 221,
376
+ "TACCC": 222,
377
+ "TTTTTG": 223,
378
+ "AACAC": 224,
379
+ "TGCTC": 225,
380
+ "AGATC": 226,
381
+ "TCCCAGC": 227,
382
+ "AGCTG": 228,
383
+ "AATAG": 229,
384
+ "TCTTG": 230,
385
+ "AGTGGC": 231,
386
+ "ATTGG": 232,
387
+ "TACTC": 233,
388
+ "TAAAC": 234,
389
+ "AATGG": 235,
390
+ "AGGTC": 236,
391
+ "AGGAC": 237,
392
+ "TTGTG": 238,
393
+ "TATAC": 239,
394
+ "ATTTTC": 240,
395
+ "ATATAA": 241,
396
+ "AGGCTG": 242,
397
+ "ATTTAA": 243,
398
+ "AGTT": 244,
399
+ "AGTAG": 245,
400
+ "ATGAC": 246,
401
+ "AATGC": 247,
402
+ "TCCAC": 248,
403
+ "CCCC": 249,
404
+ "ATGTC": 250,
405
+ "AACTC": 251,
406
+ "TTTTTC": 252,
407
+ "TAAGC": 253,
408
+ "AAGTC": 254,
409
+ "TGGTG": 255,
410
+ "TATAA": 256,
411
+ "AGTGC": 257,
412
+ "TAAGG": 258,
413
+ "ACCTG": 259,
414
+ "TTAGC": 260,
415
+ "AAATAA": 261,
416
+ "TGCCTC": 262,
417
+ "AATCC": 263,
418
+ "TTGGCC": 264,
419
+ "TAGGG": 265,
420
+ "TGGAC": 266,
421
+ "TTGTC": 267,
422
+ "AACCC": 268,
423
+ "TTACC": 269,
424
+ "TAACC": 270,
425
+ "AATTTT": 271,
426
+ "AAAGAA": 272,
427
+ "ATTATT": 273,
428
+ "AGCG": 274,
429
+ "AAAAAC": 275,
430
+ "TAATG": 276,
431
+ "TTGAC": 277,
432
+ "AGTCC": 278,
433
+ "AACTG": 279,
434
+ "AGTTG": 280,
435
+ "AATTG": 281,
436
+ "TCTGC": 282,
437
+ "TTAGG": 283,
438
+ "TACAC": 284,
439
+ "AGAAGG": 285,
440
+ "ATATTC": 286,
441
+ "AAAACC": 287,
442
+ "AAAAGC": 288,
443
+ "TGCCC": 289,
444
+ "ACTGC": 290,
445
+ "AGAAGC": 291,
446
+ "TAATAA": 292,
447
+ "AATATT": 293,
448
+ "ACCATG": 294,
449
+ "TGGTC": 295,
450
+ "TTTTGC": 296,
451
+ "AACG": 297,
452
+ "TACTG": 298,
453
+ "ACACACAC": 299,
454
+ "ATTTTG": 300,
455
+ "TCCG": 301,
456
+ "TGCG": 302,
457
+ "AAAATG": 303,
458
+ "ACATG": 304,
459
+ "TCAGC": 305,
460
+ "ATCG": 306,
461
+ "AGTAC": 307,
462
+ "TTTTGG": 308,
463
+ "AATAT": 309,
464
+ "AGAGAA": 310,
465
+ "TTCG": 311,
466
+ "TCCAGCC": 312,
467
+ "ATATAC": 313,
468
+ "TCACC": 314,
469
+ "AAAAGG": 315,
470
+ "TGTGTGTG": 316,
471
+ "TCATC": 317,
472
+ "TGCTGGG": 318,
473
+ "TGAAG": 319,
474
+ "TGTAG": 320,
475
+ "TGTGG": 321,
476
+ "AAAAATT": 322,
477
+ "ACTTC": 323,
478
+ "TTCCCC": 324,
479
+ "ATAGAA": 325,
480
+ "TTGCCC": 326,
481
+ "AGGAGG": 327,
482
+ "TTTCCC": 328,
483
+ "TATATT": 329,
484
+ "ACCG": 330,
485
+ "ACTAC": 331,
486
+ "TCACTGC": 332,
487
+ "GCG": 333,
488
+ "TTTGTG": 334,
489
+ "ACAGC": 335,
490
+ "TCATG": 336,
491
+ "AGTTTT": 337,
492
+ "AGGAA": 338,
493
+ "TTTATG": 339,
494
+ "ATATTG": 340,
495
+ "TGATG": 341,
496
+ "TCTAA": 342,
497
+ "TGTGC": 343,
498
+ "AGGAAG": 344,
499
+ "TTTGGG": 345,
500
+ "TGTTC": 346,
501
+ "AGCCCC": 347,
502
+ "AGTTTC": 348,
503
+ "AGGCTGG": 349,
504
+ "TTTGCC": 350,
505
+ "ATTTCC": 351,
506
+ "ATACAC": 352,
507
+ "AAAATAA": 353,
508
+ "TAGAC": 354,
509
+ "AGGAGAA": 355,
510
+ "TGAGC": 356,
511
+ "TGGAA": 357,
512
+ "TTTTTAA": 358,
513
+ "AGCCTCCC": 359,
514
+ "ATGAA": 360,
515
+ "TTTAAG": 361,
516
+ "TCTGG": 362,
517
+ "TTTATC": 363,
518
+ "TTATAA": 364,
519
+ "TGATT": 365,
520
+ "AACAA": 366,
521
+ "TAGCTGGG": 367,
522
+ "TCAAG": 368,
523
+ "AAAAAA": 369,
524
+ "ACTTTGGG": 370,
525
+ "TATTCC": 371,
526
+ "TCAGG": 372,
527
+ "AACAG": 373,
528
+ "TTCTTC": 374,
529
+ "TGTGGC": 375,
530
+ "ATATGC": 376,
531
+ "ATTACAGGC": 377,
532
+ "AGGGGC": 378,
533
+ "AGGGCC": 379,
534
+ "TTATTC": 380,
535
+ "ATATCC": 381,
536
+ "TGTAATCCCAGC": 382,
537
+ "TACG": 383,
538
+ "AGAAAC": 384,
539
+ "TGTCC": 385,
540
+ "AGATGG": 386,
541
+ "TGTGCC": 387,
542
+ "TTTCTC": 388,
543
+ "TGAAC": 389,
544
+ "AGTCTC": 390,
545
+ "TGTTG": 391,
546
+ "ATTTTTT": 392,
547
+ "AAGAAG": 393,
548
+ "TGGGGC": 394,
549
+ "AGCAGC": 395,
550
+ "GCCC": 396,
551
+ "TTTGGC": 397,
552
+ "AGGCTGAGGC": 398,
553
+ "TGGGCC": 399,
554
+ "TTCTCC": 400,
555
+ "TAGAA": 401,
556
+ "TGGAGTGC": 402,
557
+ "ATTAA": 403,
558
+ "AGTGCC": 404,
559
+ "TGTCTC": 405,
560
+ "ATATGG": 406,
561
+ "ACATC": 407,
562
+ "TGGGGG": 408,
563
+ "TGACC": 409,
564
+ "ACTCC": 410,
565
+ "TAAAAC": 411,
566
+ "AGATAA": 412,
567
+ "TAATTTT": 413,
568
+ "TCAAC": 414,
569
+ "TCTAC": 415,
570
+ "TCTAG": 416,
571
+ "GAG": 417,
572
+ "TAAATG": 418,
573
+ "AGCAA": 419,
574
+ "TATATG": 420,
575
+ "ATATATAT": 421,
576
+ "ATTTGC": 422,
577
+ "TCCTCC": 423,
578
+ "CCCAC": 424,
579
+ "ATTTATT": 425,
580
+ "TCTGCC": 426,
581
+ "ATGGCC": 427,
582
+ "TCGC": 428,
583
+ "AGTATT": 429,
584
+ "AGAACC": 430,
585
+ "TTAAAC": 431,
586
+ "AAATTC": 432,
587
+ "AGAGAC": 433,
588
+ "ATTTAC": 434,
589
+ "ATTGCC": 435,
590
+ "AACAAC": 436,
591
+ "TTTAAC": 437,
592
+ "ACGG": 438,
593
+ "AAGAAAA": 439,
594
+ "TCTGGC": 440,
595
+ "ATTCTCC": 441,
596
+ "AGGTGG": 442,
597
+ "TGCTGC": 443,
598
+ "TTCAAG": 444,
599
+ "AGAGGG": 445,
600
+ "ACACC": 446,
601
+ "TCTTTT": 447,
602
+ "AGAGGC": 448,
603
+ "ATCACC": 449,
604
+ "TAAATT": 450,
605
+ "AAGGCC": 451,
606
+ "TTGCAGTG": 452,
607
+ "TGTAC": 453,
608
+ "AATTTC": 454,
609
+ "ATCCCC": 455,
610
+ "ACAAG": 456,
611
+ "ACAGG": 457,
612
+ "ACAAC": 458,
613
+ "TGCCCC": 459,
614
+ "AGATTC": 460,
615
+ "TTAGAA": 461,
616
+ "TTGGGG": 462,
617
+ "AGACAC": 463,
618
+ "TGGAAG": 464,
619
+ "ACCTCC": 465,
620
+ "ATGGGG": 466,
621
+ "AGCCTCC": 467,
622
+ "TTATTG": 468,
623
+ "TAAAAG": 469,
624
+ "ATCTTC": 470,
625
+ "ATCTCC": 471,
626
+ "TGAAGC": 472,
627
+ "TAATC": 473,
628
+ "AAATGC": 474,
629
+ "TTGTTG": 475,
630
+ "ATTCCC": 476,
631
+ "TACTAAAA": 477,
632
+ "ATAGTG": 478,
633
+ "AAATAC": 479,
634
+ "TTGGGC": 480,
635
+ "TAGAGAC": 481,
636
+ "TGTTTT": 482,
637
+ "TTCTGC": 483,
638
+ "TGGCCC": 484,
639
+ "TCTGTC": 485,
640
+ "AGCTCC": 486,
641
+ "AACTCC": 487,
642
+ "TTAGCC": 488,
643
+ "AAAGTGCTGGG": 489,
644
+ "ATAGAC": 490,
645
+ "TATTTTTAG": 491,
646
+ "ACTTG": 492,
647
+ "ACCACC": 493,
648
+ "AAACAC": 494,
649
+ "GTGG": 495,
650
+ "ATTTAG": 496,
651
+ "AGGAGC": 497,
652
+ "AGGCTGGAGTGC": 498,
653
+ "ATACCC": 499,
654
+ "ATGTAA": 500,
655
+ "ACGC": 501,
656
+ "AGTAT": 502,
657
+ "TTTACC": 503,
658
+ "ACTAA": 504,
659
+ "AGGCCC": 505,
660
+ "AAGGGG": 506,
661
+ "TCTCG": 507,
662
+ "ATGAAG": 508,
663
+ "AAAGAC": 509,
664
+ "TGAAAA": 510,
665
+ "AAGGGC": 511,
666
+ "ATAGGC": 512,
667
+ "AGAGTG": 513,
668
+ "AGCTGC": 514,
669
+ "ATGTTC": 515,
670
+ "TATTTC": 516,
671
+ "TGATC": 517,
672
+ "AGTTTG": 518,
673
+ "AGCTAA": 519,
674
+ "AGAGCC": 520,
675
+ "TGCTTC": 521,
676
+ "ATCATC": 522,
677
+ "AACATGG": 523,
678
+ "AGCTTC": 524,
679
+ "AAGAAC": 525,
680
+ "TTTTTTG": 526,
681
+ "AGGGGG": 527,
682
+ "ATAAGC": 528,
683
+ "TAAGCC": 529,
684
+ "ACTGG": 530,
685
+ "ACAAAA": 531,
686
+ "ATCATT": 532,
687
+ "TCTTTC": 533,
688
+ "ATGATG": 534,
689
+ "TGCAA": 535,
690
+ "AGGTTC": 536,
691
+ "AACATT": 537,
692
+ "ATGGGC": 538,
693
+ "ATAGAG": 539,
694
+ "AAATGG": 540,
695
+ "AGTTCC": 541,
696
+ "TTTAGC": 542,
697
+ "AACTTC": 543,
698
+ "AGCAAG": 544,
699
+ "ATAAAAC": 545,
700
+ "AAAATC": 546,
701
+ "AGCCAC": 547,
702
+ "AGGAAC": 548,
703
+ "TTAACC": 549,
704
+ "TATTTATT": 550,
705
+ "TTTCTG": 551,
706
+ "ATAAGG": 552,
707
+ "AGCCACC": 553,
708
+ "AGATGC": 554,
709
+ "TTAAGC": 555,
710
+ "TTGTAA": 556,
711
+ "AGTGTG": 557,
712
+ "AACCCC": 558,
713
+ "TTCATT": 559,
714
+ "ATCATG": 560,
715
+ "AATGAA": 561,
716
+ "AGGTGC": 562,
717
+ "AAAAAAAAAAAAAAAA": 563,
718
+ "AGGATG": 564,
719
+ "AGCCG": 565,
720
+ "TGGTGG": 566,
721
+ "AGTGGG": 567,
722
+ "TGCACTCCAGCC": 568,
723
+ "TATTGC": 569,
724
+ "TAGTC": 570,
725
+ "CCCG": 571,
726
+ "AAGTAA": 572,
727
+ "TAGTG": 573,
728
+ "TTTTTTTTTTTTTTTT": 574,
729
+ "AGCATT": 575,
730
+ "ATCTGC": 576,
731
+ "TCTCAC": 577,
732
+ "AAATTG": 578,
733
+ "TTTAGG": 579,
734
+ "AGACCC": 580,
735
+ "GGGCC": 581,
736
+ "TCCTTC": 582,
737
+ "ATAGGG": 583,
738
+ "AATATG": 584,
739
+ "TTATAC": 585,
740
+ "TAGAAG": 586,
741
+ "AAAGTG": 587,
742
+ "AAATCC": 588,
743
+ "TTCCTC": 589,
744
+ "TTTCAC": 590,
745
+ "AGTATG": 591,
746
+ "TACTAAAAATAC": 592,
747
+ "ATGTGC": 593,
748
+ "AGGAGGC": 594,
749
+ "TATATC": 595,
750
+ "TTCTAA": 596,
751
+ "TGAGGC": 597,
752
+ "ACACAC": 598,
753
+ "TCCCCC": 599,
754
+ "AACATC": 600,
755
+ "AAGCG": 601,
756
+ "AATGGC": 602,
757
+ "ACCCCC": 603,
758
+ "AGATAC": 604,
759
+ "ATAAAAG": 605,
760
+ "ATGATT": 606,
761
+ "TGGAGG": 607,
762
+ "AGTTAA": 608,
763
+ "": 609
764
+ },
765
+ "merges": [
766
+ [
767
+ "A",
768
+ ""
769
+ ],
770
+ [
771
+ "C",
772
+ ""
773
+ ],
774
+ [
775
+ "G",
776
+ ""
777
+ ],
778
+ [
779
+ "T",
780
+ ""
781
+ ],
782
+ [
783
+ "T",
784
+ "T"
785
+ ],
786
+ [
787
+ "A",
788
+ "A"
789
+ ],
790
+ [
791
+ "T",
792
+ "G"
793
+ ],
794
+ [
795
+ "A",
796
+ "G"
797
+ ],
798
+ [
799
+ "C",
800
+ "C"
801
+ ],
802
+ [
803
+ "T",
804
+ "C"
805
+ ],
806
+ [
807
+ "A",
808
+ "C"
809
+ ],
810
+ [
811
+ "G",
812
+ "G"
813
+ ],
814
+ [
815
+ "A",
816
+ "TT"
817
+ ],
818
+ [
819
+ "A",
820
+ "T"
821
+ ],
822
+ [
823
+ "A",
824
+ "TG"
825
+ ],
826
+ [
827
+ "G",
828
+ "C"
829
+ ],
830
+ [
831
+ "T",
832
+ "AA"
833
+ ],
834
+ [
835
+ "T",
836
+ "CC"
837
+ ],
838
+ [
839
+ "A",
840
+ "CC"
841
+ ],
842
+ [
843
+ "AA",
844
+ "AA"
845
+ ],
846
+ [
847
+ "AG",
848
+ "G"
849
+ ],
850
+ [
851
+ "A",
852
+ "TC"
853
+ ],
854
+ [
855
+ "AG",
856
+ "C"
857
+ ],
858
+ [
859
+ "TT",
860
+ "C"
861
+ ],
862
+ [
863
+ "AA",
864
+ "G"
865
+ ],
866
+ [
867
+ "TT",
868
+ "TT"
869
+ ],
870
+ [
871
+ "TG",
872
+ "C"
873
+ ],
874
+ [
875
+ "TG",
876
+ "G"
877
+ ],
878
+ [
879
+ "AA",
880
+ "C"
881
+ ],
882
+ [
883
+ "TT",
884
+ "G"
885
+ ],
886
+ [
887
+ "T",
888
+ "AG"
889
+ ],
890
+ [
891
+ "T",
892
+ "AC"
893
+ ],
894
+ [
895
+ "CC",
896
+ "C"
897
+ ],
898
+ [
899
+ "T",
900
+ "ATT"
901
+ ],
902
+ [
903
+ "TG",
904
+ "GG"
905
+ ],
906
+ [
907
+ "T",
908
+ "AT"
909
+ ],
910
+ [
911
+ "AG",
912
+ "AA"
913
+ ],
914
+ [
915
+ "AG",
916
+ "GG"
917
+ ],
918
+ [
919
+ "TT",
920
+ "TC"
921
+ ],
922
+ [
923
+ "AG",
924
+ "GC"
925
+ ],
926
+ [
927
+ "AG",
928
+ "CC"
929
+ ],
930
+ [
931
+ "AT",
932
+ "AA"
933
+ ],
934
+ [
935
+ "TG",
936
+ "TG"
937
+ ],
938
+ [
939
+ "TT",
940
+ "GG"
941
+ ],
942
+ [
943
+ "ATT",
944
+ "C"
945
+ ],
946
+ [
947
+ "AA",
948
+ "GG"
949
+ ],
950
+ [
951
+ "AC",
952
+ "AC"
953
+ ],
954
+ [
955
+ "TCC",
956
+ "C"
957
+ ],
958
+ [
959
+ "TC",
960
+ "TC"
961
+ ],
962
+ [
963
+ "T",
964
+ "ATG"
965
+ ],
966
+ [
967
+ "TT",
968
+ "TG"
969
+ ],
970
+ [
971
+ "TT",
972
+ "CC"
973
+ ],
974
+ [
975
+ "AG",
976
+ "TG"
977
+ ],
978
+ [
979
+ "ATG",
980
+ "G"
981
+ ],
982
+ [
983
+ "AG",
984
+ "AC"
985
+ ],
986
+ [
987
+ "AA",
988
+ "AC"
989
+ ],
990
+ [
991
+ "ACC",
992
+ "C"
993
+ ],
994
+ [
995
+ "TG",
996
+ "CC"
997
+ ],
998
+ [
999
+ "ATT",
1000
+ "G"
1001
+ ],
1002
+ [
1003
+ "AT",
1004
+ "CC"
1005
+ ],
1006
+ [
1007
+ "AG",
1008
+ "AG"
1009
+ ],
1010
+ [
1011
+ "ATG",
1012
+ "C"
1013
+ ],
1014
+ [
1015
+ "AT",
1016
+ "AC"
1017
+ ],
1018
+ [
1019
+ "TC",
1020
+ "TG"
1021
+ ],
1022
+ [
1023
+ "TT",
1024
+ "AA"
1025
+ ],
1026
+ [
1027
+ "TC",
1028
+ "AC"
1029
+ ],
1030
+ [
1031
+ "TG",
1032
+ "AA"
1033
+ ],
1034
+ [
1035
+ "TG",
1036
+ "GC"
1037
+ ],
1038
+ [
1039
+ "TT",
1040
+ "GC"
1041
+ ],
1042
+ [
1043
+ "TAA",
1044
+ "G"
1045
+ ],
1046
+ [
1047
+ "T",
1048
+ "ATC"
1049
+ ],
1050
+ [
1051
+ "TAA",
1052
+ "C"
1053
+ ],
1054
+ [
1055
+ "AA",
1056
+ "AG"
1057
+ ],
1058
+ [
1059
+ "TT",
1060
+ "AC"
1061
+ ],
1062
+ [
1063
+ "AA",
1064
+ "GC"
1065
+ ],
1066
+ [
1067
+ "GG",
1068
+ "G"
1069
+ ],
1070
+ [
1071
+ "T",
1072
+ "AGC"
1073
+ ],
1074
+ [
1075
+ "GG",
1076
+ "C"
1077
+ ],
1078
+ [
1079
+ "AT",
1080
+ "AT"
1081
+ ],
1082
+ [
1083
+ "T",
1084
+ "ACC"
1085
+ ],
1086
+ [
1087
+ "AA",
1088
+ "CC"
1089
+ ],
1090
+ [
1091
+ "AA",
1092
+ "TG"
1093
+ ],
1094
+ [
1095
+ "T",
1096
+ "AGG"
1097
+ ],
1098
+ [
1099
+ "G",
1100
+ "CC"
1101
+ ],
1102
+ [
1103
+ "AT",
1104
+ "ATT"
1105
+ ],
1106
+ [
1107
+ "AG",
1108
+ "TC"
1109
+ ],
1110
+ [
1111
+ "TT",
1112
+ "TTC"
1113
+ ],
1114
+ [
1115
+ "AAAA",
1116
+ "C"
1117
+ ],
1118
+ [
1119
+ "TG",
1120
+ "AC"
1121
+ ],
1122
+ [
1123
+ "TT",
1124
+ "TAA"
1125
+ ],
1126
+ [
1127
+ "AAAA",
1128
+ "G"
1129
+ ],
1130
+ [
1131
+ "AA",
1132
+ "TC"
1133
+ ],
1134
+ [
1135
+ "TG",
1136
+ "TC"
1137
+ ],
1138
+ [
1139
+ "TT",
1140
+ "ATT"
1141
+ ],
1142
+ [
1143
+ "AT",
1144
+ "AG"
1145
+ ],
1146
+ [
1147
+ "TG",
1148
+ "AG"
1149
+ ],
1150
+ [
1151
+ "TTTT",
1152
+ "G"
1153
+ ],
1154
+ [
1155
+ "AA",
1156
+ "ATT"
1157
+ ],
1158
+ [
1159
+ "AA",
1160
+ "TT"
1161
+ ],
1162
+ [
1163
+ "AA",
1164
+ "TAA"
1165
+ ],
1166
+ [
1167
+ "TT",
1168
+ "TCC"
1169
+ ],
1170
+ [
1171
+ "AC",
1172
+ "AG"
1173
+ ],
1174
+ [
1175
+ "TC",
1176
+ "AG"
1177
+ ],
1178
+ [
1179
+ "AA",
1180
+ "ATG"
1181
+ ],
1182
+ [
1183
+ "TGGG",
1184
+ "C"
1185
+ ],
1186
+ [
1187
+ "AC",
1188
+ "TC"
1189
+ ],
1190
+ [
1191
+ "AGG",
1192
+ "CC"
1193
+ ],
1194
+ [
1195
+ "TT",
1196
+ "AG"
1197
+ ],
1198
+ [
1199
+ "AC",
1200
+ "TG"
1201
+ ],
1202
+ [
1203
+ "AC",
1204
+ "G"
1205
+ ],
1206
+ [
1207
+ "AT",
1208
+ "ATG"
1209
+ ],
1210
+ [
1211
+ "TGG",
1212
+ "CC"
1213
+ ],
1214
+ [
1215
+ "ATT",
1216
+ "TC"
1217
+ ],
1218
+ [
1219
+ "AC",
1220
+ "AA"
1221
+ ],
1222
+ [
1223
+ "ATC",
1224
+ "TC"
1225
+ ],
1226
+ [
1227
+ "TATT",
1228
+ "C"
1229
+ ],
1230
+ [
1231
+ "TG",
1232
+ "TAA"
1233
+ ],
1234
+ [
1235
+ "AC",
1236
+ "TT"
1237
+ ],
1238
+ [
1239
+ "ATG",
1240
+ "CC"
1241
+ ],
1242
+ [
1243
+ "TAA",
1244
+ "AA"
1245
+ ],
1246
+ [
1247
+ "AAAA",
1248
+ "AAAA"
1249
+ ],
1250
+ [
1251
+ "ATT",
1252
+ "CC"
1253
+ ],
1254
+ [
1255
+ "TT",
1256
+ "TAG"
1257
+ ],
1258
+ [
1259
+ "TCC",
1260
+ "CC"
1261
+ ],
1262
+ [
1263
+ "TT",
1264
+ "TGC"
1265
+ ],
1266
+ [
1267
+ "TT",
1268
+ "CCC"
1269
+ ],
1270
+ [
1271
+ "TGGG",
1272
+ "G"
1273
+ ],
1274
+ [
1275
+ "TTC",
1276
+ "TC"
1277
+ ],
1278
+ [
1279
+ "AT",
1280
+ "AAAA"
1281
+ ],
1282
+ [
1283
+ "AG",
1284
+ "AAG"
1285
+ ],
1286
+ [
1287
+ "TTTT",
1288
+ "TTTT"
1289
+ ],
1290
+ [
1291
+ "ACC",
1292
+ "CC"
1293
+ ],
1294
+ [
1295
+ "AGGG",
1296
+ "C"
1297
+ ],
1298
+ [
1299
+ "ACC",
1300
+ "TC"
1301
+ ],
1302
+ [
1303
+ "AG",
1304
+ "ATG"
1305
+ ],
1306
+ [
1307
+ "ATT",
1308
+ "AC"
1309
+ ],
1310
+ [
1311
+ "AAG",
1312
+ "CC"
1313
+ ],
1314
+ [
1315
+ "GG",
1316
+ "CC"
1317
+ ],
1318
+ [
1319
+ "AGG",
1320
+ "AG"
1321
+ ],
1322
+ [
1323
+ "TC",
1324
+ "AA"
1325
+ ],
1326
+ [
1327
+ "ATT",
1328
+ "GC"
1329
+ ],
1330
+ [
1331
+ "TATT",
1332
+ "G"
1333
+ ],
1334
+ [
1335
+ "AT",
1336
+ "AAC"
1337
+ ],
1338
+ [
1339
+ "AT",
1340
+ "ATC"
1341
+ ],
1342
+ [
1343
+ "TT",
1344
+ "TAC"
1345
+ ],
1346
+ [
1347
+ "ATG",
1348
+ "GC"
1349
+ ],
1350
+ [
1351
+ "AAGG",
1352
+ "C"
1353
+ ],
1354
+ [
1355
+ "ACC",
1356
+ "AC"
1357
+ ],
1358
+ [
1359
+ "G",
1360
+ "TG"
1361
+ ],
1362
+ [
1363
+ "AT",
1364
+ "CCC"
1365
+ ],
1366
+ [
1367
+ "AG",
1368
+ "AAC"
1369
+ ],
1370
+ [
1371
+ "ATT",
1372
+ "TT"
1373
+ ],
1374
+ [
1375
+ "TTG",
1376
+ "CC"
1377
+ ],
1378
+ [
1379
+ "AA",
1380
+ "ATC"
1381
+ ],
1382
+ [
1383
+ "AT",
1384
+ "AAG"
1385
+ ],
1386
+ [
1387
+ "TTGG",
1388
+ "C"
1389
+ ],
1390
+ [
1391
+ "TGG",
1392
+ "AG"
1393
+ ],
1394
+ [
1395
+ "ATG",
1396
+ "GG"
1397
+ ],
1398
+ [
1399
+ "AA",
1400
+ "AGC"
1401
+ ],
1402
+ [
1403
+ "AGGG",
1404
+ "G"
1405
+ ],
1406
+ [
1407
+ "ATC",
1408
+ "AC"
1409
+ ],
1410
+ [
1411
+ "ATT",
1412
+ "TG"
1413
+ ],
1414
+ [
1415
+ "AA",
1416
+ "TTC"
1417
+ ],
1418
+ [
1419
+ "TGC",
1420
+ "AC"
1421
+ ],
1422
+ [
1423
+ "TT",
1424
+ "TGG"
1425
+ ],
1426
+ [
1427
+ "TC",
1428
+ "G"
1429
+ ],
1430
+ [
1431
+ "AG",
1432
+ "AGC"
1433
+ ],
1434
+ [
1435
+ "AA",
1436
+ "AGG"
1437
+ ],
1438
+ [
1439
+ "GG",
1440
+ "GC"
1441
+ ],
1442
+ [
1443
+ "TTGG",
1444
+ "G"
1445
+ ],
1446
+ [
1447
+ "AG",
1448
+ "AAAA"
1449
+ ],
1450
+ [
1451
+ "TAT",
1452
+ "CC"
1453
+ ],
1454
+ [
1455
+ "TC",
1456
+ "TCC"
1457
+ ],
1458
+ [
1459
+ "AT",
1460
+ "AGC"
1461
+ ],
1462
+ [
1463
+ "TG",
1464
+ "AGG"
1465
+ ],
1466
+ [
1467
+ "TT",
1468
+ "TATT"
1469
+ ],
1470
+ [
1471
+ "AG",
1472
+ "TAA"
1473
+ ],
1474
+ [
1475
+ "AG",
1476
+ "AGG"
1477
+ ],
1478
+ [
1479
+ "TC",
1480
+ "TTC"
1481
+ ],
1482
+ [
1483
+ "AC",
1484
+ "ATT"
1485
+ ],
1486
+ [
1487
+ "TCC",
1488
+ "TG"
1489
+ ],
1490
+ [
1491
+ "AG",
1492
+ "CCC"
1493
+ ],
1494
+ [
1495
+ "TATG",
1496
+ "C"
1497
+ ],
1498
+ [
1499
+ "TT",
1500
+ "AAAA"
1501
+ ],
1502
+ [
1503
+ "AG",
1504
+ "ATT"
1505
+ ],
1506
+ [
1507
+ "TT",
1508
+ "AAC"
1509
+ ],
1510
+ [
1511
+ "GG",
1512
+ "GG"
1513
+ ],
1514
+ [
1515
+ "AAG",
1516
+ "AC"
1517
+ ],
1518
+ [
1519
+ "TC",
1520
+ "ATT"
1521
+ ],
1522
+ [
1523
+ "TTC",
1524
+ "TG"
1525
+ ],
1526
+ [
1527
+ "AG",
1528
+ "ACC"
1529
+ ],
1530
+ [
1531
+ "AAGG",
1532
+ "G"
1533
+ ],
1534
+ [
1535
+ "AT",
1536
+ "ACC"
1537
+ ],
1538
+ [
1539
+ "TT",
1540
+ "TAT"
1541
+ ],
1542
+ [
1543
+ "AAG",
1544
+ "TG"
1545
+ ],
1546
+ [
1547
+ "TT",
1548
+ "ATG"
1549
+ ],
1550
+ [
1551
+ "AAG",
1552
+ "AA"
1553
+ ],
1554
+ [
1555
+ "TAG",
1556
+ "CC"
1557
+ ],
1558
+ [
1559
+ "TTC",
1560
+ "AC"
1561
+ ],
1562
+ [
1563
+ "AGG",
1564
+ "TG"
1565
+ ],
1566
+ [
1567
+ "TTG",
1568
+ "AA"
1569
+ ],
1570
+ [
1571
+ "ATC",
1572
+ "TG"
1573
+ ],
1574
+ [
1575
+ "AGC",
1576
+ "AC"
1577
+ ],
1578
+ [
1579
+ "TGC",
1580
+ "TG"
1581
+ ],
1582
+ [
1583
+ "AA",
1584
+ "ACC"
1585
+ ],
1586
+ [
1587
+ "ATG",
1588
+ "TG"
1589
+ ],
1590
+ [
1591
+ "TTTT",
1592
+ "CC"
1593
+ ],
1594
+ [
1595
+ "AG",
1596
+ "TTC"
1597
+ ],
1598
+ [
1599
+ "TCC",
1600
+ "TC"
1601
+ ],
1602
+ [
1603
+ "TATG",
1604
+ "G"
1605
+ ],
1606
+ [
1607
+ "AA",
1608
+ "TAC"
1609
+ ],
1610
+ [
1611
+ "AG",
1612
+ "TGG"
1613
+ ],
1614
+ [
1615
+ "TAG",
1616
+ "GC"
1617
+ ],
1618
+ [
1619
+ "AGC",
1620
+ "TC"
1621
+ ],
1622
+ [
1623
+ "AT",
1624
+ "AGG"
1625
+ ],
1626
+ [
1627
+ "TT",
1628
+ "ATC"
1629
+ ],
1630
+ [
1631
+ "TT",
1632
+ "AAG"
1633
+ ],
1634
+ [
1635
+ "T",
1636
+ "ACCC"
1637
+ ],
1638
+ [
1639
+ "TTTT",
1640
+ "TG"
1641
+ ],
1642
+ [
1643
+ "AAC",
1644
+ "AC"
1645
+ ],
1646
+ [
1647
+ "TGC",
1648
+ "TC"
1649
+ ],
1650
+ [
1651
+ "AG",
1652
+ "ATC"
1653
+ ],
1654
+ [
1655
+ "TCCC",
1656
+ "AGC"
1657
+ ],
1658
+ [
1659
+ "AGC",
1660
+ "TG"
1661
+ ],
1662
+ [
1663
+ "AA",
1664
+ "TAG"
1665
+ ],
1666
+ [
1667
+ "TC",
1668
+ "TTG"
1669
+ ],
1670
+ [
1671
+ "AGTG",
1672
+ "GC"
1673
+ ],
1674
+ [
1675
+ "ATT",
1676
+ "GG"
1677
+ ],
1678
+ [
1679
+ "TAC",
1680
+ "TC"
1681
+ ],
1682
+ [
1683
+ "TAA",
1684
+ "AC"
1685
+ ],
1686
+ [
1687
+ "AA",
1688
+ "TGG"
1689
+ ],
1690
+ [
1691
+ "AGG",
1692
+ "TC"
1693
+ ],
1694
+ [
1695
+ "AGG",
1696
+ "AC"
1697
+ ],
1698
+ [
1699
+ "TTG",
1700
+ "TG"
1701
+ ],
1702
+ [
1703
+ "TAT",
1704
+ "AC"
1705
+ ],
1706
+ [
1707
+ "ATT",
1708
+ "TTC"
1709
+ ],
1710
+ [
1711
+ "AT",
1712
+ "ATAA"
1713
+ ],
1714
+ [
1715
+ "AGGC",
1716
+ "TG"
1717
+ ],
1718
+ [
1719
+ "ATT",
1720
+ "TAA"
1721
+ ],
1722
+ [
1723
+ "AG",
1724
+ "TT"
1725
+ ],
1726
+ [
1727
+ "AG",
1728
+ "TAG"
1729
+ ],
1730
+ [
1731
+ "ATG",
1732
+ "AC"
1733
+ ],
1734
+ [
1735
+ "AA",
1736
+ "TGC"
1737
+ ],
1738
+ [
1739
+ "TCC",
1740
+ "AC"
1741
+ ],
1742
+ [
1743
+ "CC",
1744
+ "CC"
1745
+ ],
1746
+ [
1747
+ "ATG",
1748
+ "TC"
1749
+ ],
1750
+ [
1751
+ "AAC",
1752
+ "TC"
1753
+ ],
1754
+ [
1755
+ "TTTT",
1756
+ "TC"
1757
+ ],
1758
+ [
1759
+ "TAA",
1760
+ "GC"
1761
+ ],
1762
+ [
1763
+ "AAG",
1764
+ "TC"
1765
+ ],
1766
+ [
1767
+ "TGG",
1768
+ "TG"
1769
+ ],
1770
+ [
1771
+ "TAT",
1772
+ "AA"
1773
+ ],
1774
+ [
1775
+ "AG",
1776
+ "TGC"
1777
+ ],
1778
+ [
1779
+ "TAA",
1780
+ "GG"
1781
+ ],
1782
+ [
1783
+ "ACC",
1784
+ "TG"
1785
+ ],
1786
+ [
1787
+ "TT",
1788
+ "AGC"
1789
+ ],
1790
+ [
1791
+ "AA",
1792
+ "ATAA"
1793
+ ],
1794
+ [
1795
+ "TGCC",
1796
+ "TC"
1797
+ ],
1798
+ [
1799
+ "AA",
1800
+ "TCC"
1801
+ ],
1802
+ [
1803
+ "TTGG",
1804
+ "CC"
1805
+ ],
1806
+ [
1807
+ "TAG",
1808
+ "GG"
1809
+ ],
1810
+ [
1811
+ "TGG",
1812
+ "AC"
1813
+ ],
1814
+ [
1815
+ "TTG",
1816
+ "TC"
1817
+ ],
1818
+ [
1819
+ "AA",
1820
+ "CCC"
1821
+ ],
1822
+ [
1823
+ "TT",
1824
+ "ACC"
1825
+ ],
1826
+ [
1827
+ "TAA",
1828
+ "CC"
1829
+ ],
1830
+ [
1831
+ "AA",
1832
+ "TTTT"
1833
+ ],
1834
+ [
1835
+ "AA",
1836
+ "AGAA"
1837
+ ],
1838
+ [
1839
+ "ATT",
1840
+ "ATT"
1841
+ ],
1842
+ [
1843
+ "AGC",
1844
+ "G"
1845
+ ],
1846
+ [
1847
+ "AAAA",
1848
+ "AC"
1849
+ ],
1850
+ [
1851
+ "TAA",
1852
+ "TG"
1853
+ ],
1854
+ [
1855
+ "TTG",
1856
+ "AC"
1857
+ ],
1858
+ [
1859
+ "AG",
1860
+ "TCC"
1861
+ ],
1862
+ [
1863
+ "AAC",
1864
+ "TG"
1865
+ ],
1866
+ [
1867
+ "AG",
1868
+ "TTG"
1869
+ ],
1870
+ [
1871
+ "AA",
1872
+ "TTG"
1873
+ ],
1874
+ [
1875
+ "TC",
1876
+ "TGC"
1877
+ ],
1878
+ [
1879
+ "TT",
1880
+ "AGG"
1881
+ ],
1882
+ [
1883
+ "TAC",
1884
+ "AC"
1885
+ ],
1886
+ [
1887
+ "AGAA",
1888
+ "GG"
1889
+ ],
1890
+ [
1891
+ "AT",
1892
+ "ATTC"
1893
+ ],
1894
+ [
1895
+ "AAAA",
1896
+ "CC"
1897
+ ],
1898
+ [
1899
+ "AAAA",
1900
+ "GC"
1901
+ ],
1902
+ [
1903
+ "TG",
1904
+ "CCC"
1905
+ ],
1906
+ [
1907
+ "AC",
1908
+ "TGC"
1909
+ ],
1910
+ [
1911
+ "AGAA",
1912
+ "GC"
1913
+ ],
1914
+ [
1915
+ "TAA",
1916
+ "TAA"
1917
+ ],
1918
+ [
1919
+ "AA",
1920
+ "TATT"
1921
+ ],
1922
+ [
1923
+ "ACC",
1924
+ "ATG"
1925
+ ],
1926
+ [
1927
+ "TGG",
1928
+ "TC"
1929
+ ],
1930
+ [
1931
+ "TTTT",
1932
+ "GC"
1933
+ ],
1934
+ [
1935
+ "AAC",
1936
+ "G"
1937
+ ],
1938
+ [
1939
+ "TAC",
1940
+ "TG"
1941
+ ],
1942
+ [
1943
+ "ACAC",
1944
+ "ACAC"
1945
+ ],
1946
+ [
1947
+ "ATT",
1948
+ "TTG"
1949
+ ],
1950
+ [
1951
+ "TCC",
1952
+ "G"
1953
+ ],
1954
+ [
1955
+ "TGC",
1956
+ "G"
1957
+ ],
1958
+ [
1959
+ "AAAA",
1960
+ "TG"
1961
+ ],
1962
+ [
1963
+ "AC",
1964
+ "ATG"
1965
+ ],
1966
+ [
1967
+ "TC",
1968
+ "AGC"
1969
+ ],
1970
+ [
1971
+ "ATC",
1972
+ "G"
1973
+ ],
1974
+ [
1975
+ "AG",
1976
+ "TAC"
1977
+ ],
1978
+ [
1979
+ "TTTT",
1980
+ "GG"
1981
+ ],
1982
+ [
1983
+ "AA",
1984
+ "TAT"
1985
+ ],
1986
+ [
1987
+ "AG",
1988
+ "AGAA"
1989
+ ],
1990
+ [
1991
+ "TTC",
1992
+ "G"
1993
+ ],
1994
+ [
1995
+ "TCC",
1996
+ "AGCC"
1997
+ ],
1998
+ [
1999
+ "AT",
2000
+ "ATAC"
2001
+ ],
2002
+ [
2003
+ "TC",
2004
+ "ACC"
2005
+ ],
2006
+ [
2007
+ "AAAA",
2008
+ "GG"
2009
+ ],
2010
+ [
2011
+ "TGTG",
2012
+ "TGTG"
2013
+ ],
2014
+ [
2015
+ "TC",
2016
+ "ATC"
2017
+ ],
2018
+ [
2019
+ "TGC",
2020
+ "TGGG"
2021
+ ],
2022
+ [
2023
+ "TG",
2024
+ "AAG"
2025
+ ],
2026
+ [
2027
+ "TG",
2028
+ "TAG"
2029
+ ],
2030
+ [
2031
+ "TG",
2032
+ "TGG"
2033
+ ],
2034
+ [
2035
+ "AAAA",
2036
+ "ATT"
2037
+ ],
2038
+ [
2039
+ "AC",
2040
+ "TTC"
2041
+ ],
2042
+ [
2043
+ "TTCC",
2044
+ "CC"
2045
+ ],
2046
+ [
2047
+ "AT",
2048
+ "AGAA"
2049
+ ],
2050
+ [
2051
+ "TTG",
2052
+ "CCC"
2053
+ ],
2054
+ [
2055
+ "AGG",
2056
+ "AGG"
2057
+ ],
2058
+ [
2059
+ "TT",
2060
+ "TCCC"
2061
+ ],
2062
+ [
2063
+ "TAT",
2064
+ "ATT"
2065
+ ],
2066
+ [
2067
+ "ACC",
2068
+ "G"
2069
+ ],
2070
+ [
2071
+ "AC",
2072
+ "TAC"
2073
+ ],
2074
+ [
2075
+ "TCAC",
2076
+ "TGC"
2077
+ ],
2078
+ [
2079
+ "GC",
2080
+ "G"
2081
+ ],
2082
+ [
2083
+ "TT",
2084
+ "TGTG"
2085
+ ],
2086
+ [
2087
+ "AC",
2088
+ "AGC"
2089
+ ],
2090
+ [
2091
+ "TC",
2092
+ "ATG"
2093
+ ],
2094
+ [
2095
+ "AG",
2096
+ "TTTT"
2097
+ ],
2098
+ [
2099
+ "AGG",
2100
+ "AA"
2101
+ ],
2102
+ [
2103
+ "TT",
2104
+ "TATG"
2105
+ ],
2106
+ [
2107
+ "AT",
2108
+ "ATTG"
2109
+ ],
2110
+ [
2111
+ "TG",
2112
+ "ATG"
2113
+ ],
2114
+ [
2115
+ "TC",
2116
+ "TAA"
2117
+ ],
2118
+ [
2119
+ "TG",
2120
+ "TGC"
2121
+ ],
2122
+ [
2123
+ "AGG",
2124
+ "AAG"
2125
+ ],
2126
+ [
2127
+ "TT",
2128
+ "TGGG"
2129
+ ],
2130
+ [
2131
+ "TG",
2132
+ "TTC"
2133
+ ],
2134
+ [
2135
+ "AGCC",
2136
+ "CC"
2137
+ ],
2138
+ [
2139
+ "AG",
2140
+ "TTTC"
2141
+ ],
2142
+ [
2143
+ "AGGC",
2144
+ "TGG"
2145
+ ],
2146
+ [
2147
+ "TTTG",
2148
+ "CC"
2149
+ ],
2150
+ [
2151
+ "ATT",
2152
+ "TCC"
2153
+ ],
2154
+ [
2155
+ "AT",
2156
+ "ACAC"
2157
+ ],
2158
+ [
2159
+ "AAAA",
2160
+ "TAA"
2161
+ ],
2162
+ [
2163
+ "TAG",
2164
+ "AC"
2165
+ ],
2166
+ [
2167
+ "AGG",
2168
+ "AGAA"
2169
+ ],
2170
+ [
2171
+ "TG",
2172
+ "AGC"
2173
+ ],
2174
+ [
2175
+ "TGG",
2176
+ "AA"
2177
+ ],
2178
+ [
2179
+ "TTTT",
2180
+ "TAA"
2181
+ ],
2182
+ [
2183
+ "AGCC",
2184
+ "TCCC"
2185
+ ],
2186
+ [
2187
+ "ATG",
2188
+ "AA"
2189
+ ],
2190
+ [
2191
+ "TT",
2192
+ "TAAG"
2193
+ ],
2194
+ [
2195
+ "TC",
2196
+ "TGG"
2197
+ ],
2198
+ [
2199
+ "TT",
2200
+ "TATC"
2201
+ ],
2202
+ [
2203
+ "TT",
2204
+ "ATAA"
2205
+ ],
2206
+ [
2207
+ "TG",
2208
+ "ATT"
2209
+ ],
2210
+ [
2211
+ "AAC",
2212
+ "AA"
2213
+ ],
2214
+ [
2215
+ "TAGC",
2216
+ "TGGG"
2217
+ ],
2218
+ [
2219
+ "TC",
2220
+ "AAG"
2221
+ ],
2222
+ [
2223
+ "AAAA",
2224
+ "AA"
2225
+ ],
2226
+ [
2227
+ "ACTT",
2228
+ "TGGG"
2229
+ ],
2230
+ [
2231
+ "TATT",
2232
+ "CC"
2233
+ ],
2234
+ [
2235
+ "TC",
2236
+ "AGG"
2237
+ ],
2238
+ [
2239
+ "AAC",
2240
+ "AG"
2241
+ ],
2242
+ [
2243
+ "TTC",
2244
+ "TTC"
2245
+ ],
2246
+ [
2247
+ "TGTG",
2248
+ "GC"
2249
+ ],
2250
+ [
2251
+ "AT",
2252
+ "ATGC"
2253
+ ],
2254
+ [
2255
+ "ATTAC",
2256
+ "AGGC"
2257
+ ],
2258
+ [
2259
+ "AGGG",
2260
+ "GC"
2261
+ ],
2262
+ [
2263
+ "AGGG",
2264
+ "CC"
2265
+ ],
2266
+ [
2267
+ "TT",
2268
+ "ATTC"
2269
+ ],
2270
+ [
2271
+ "AT",
2272
+ "ATCC"
2273
+ ],
2274
+ [
2275
+ "TGTAA",
2276
+ "TCCCAGC"
2277
+ ],
2278
+ [
2279
+ "TAC",
2280
+ "G"
2281
+ ],
2282
+ [
2283
+ "AGAA",
2284
+ "AC"
2285
+ ],
2286
+ [
2287
+ "TG",
2288
+ "TCC"
2289
+ ],
2290
+ [
2291
+ "AG",
2292
+ "ATGG"
2293
+ ],
2294
+ [
2295
+ "TGTG",
2296
+ "CC"
2297
+ ],
2298
+ [
2299
+ "TTTC",
2300
+ "TC"
2301
+ ],
2302
+ [
2303
+ "TG",
2304
+ "AAC"
2305
+ ],
2306
+ [
2307
+ "AG",
2308
+ "TCTC"
2309
+ ],
2310
+ [
2311
+ "TG",
2312
+ "TTG"
2313
+ ],
2314
+ [
2315
+ "ATT",
2316
+ "TTTT"
2317
+ ],
2318
+ [
2319
+ "AAG",
2320
+ "AAG"
2321
+ ],
2322
+ [
2323
+ "TGGG",
2324
+ "GC"
2325
+ ],
2326
+ [
2327
+ "AGC",
2328
+ "AGC"
2329
+ ],
2330
+ [
2331
+ "G",
2332
+ "CCC"
2333
+ ],
2334
+ [
2335
+ "TTTG",
2336
+ "GC"
2337
+ ],
2338
+ [
2339
+ "AGGCTG",
2340
+ "AGGC"
2341
+ ],
2342
+ [
2343
+ "TGGG",
2344
+ "CC"
2345
+ ],
2346
+ [
2347
+ "TTC",
2348
+ "TCC"
2349
+ ],
2350
+ [
2351
+ "TAG",
2352
+ "AA"
2353
+ ],
2354
+ [
2355
+ "TGGAG",
2356
+ "TGC"
2357
+ ],
2358
+ [
2359
+ "ATT",
2360
+ "AA"
2361
+ ],
2362
+ [
2363
+ "AGTG",
2364
+ "CC"
2365
+ ],
2366
+ [
2367
+ "TG",
2368
+ "TCTC"
2369
+ ],
2370
+ [
2371
+ "AT",
2372
+ "ATGG"
2373
+ ],
2374
+ [
2375
+ "AC",
2376
+ "ATC"
2377
+ ],
2378
+ [
2379
+ "TGGG",
2380
+ "GG"
2381
+ ],
2382
+ [
2383
+ "TG",
2384
+ "ACC"
2385
+ ],
2386
+ [
2387
+ "AC",
2388
+ "TCC"
2389
+ ],
2390
+ [
2391
+ "TAA",
2392
+ "AAC"
2393
+ ],
2394
+ [
2395
+ "AG",
2396
+ "ATAA"
2397
+ ],
2398
+ [
2399
+ "TAA",
2400
+ "TTTT"
2401
+ ],
2402
+ [
2403
+ "TC",
2404
+ "AAC"
2405
+ ],
2406
+ [
2407
+ "TC",
2408
+ "TAC"
2409
+ ],
2410
+ [
2411
+ "TC",
2412
+ "TAG"
2413
+ ],
2414
+ [
2415
+ "G",
2416
+ "AG"
2417
+ ],
2418
+ [
2419
+ "TAA",
2420
+ "ATG"
2421
+ ],
2422
+ [
2423
+ "AGC",
2424
+ "AA"
2425
+ ],
2426
+ [
2427
+ "TAT",
2428
+ "ATG"
2429
+ ],
2430
+ [
2431
+ "ATAT",
2432
+ "ATAT"
2433
+ ],
2434
+ [
2435
+ "ATT",
2436
+ "TGC"
2437
+ ],
2438
+ [
2439
+ "TCC",
2440
+ "TCC"
2441
+ ],
2442
+ [
2443
+ "CCC",
2444
+ "AC"
2445
+ ],
2446
+ [
2447
+ "ATT",
2448
+ "TATT"
2449
+ ],
2450
+ [
2451
+ "TC",
2452
+ "TGCC"
2453
+ ],
2454
+ [
2455
+ "ATGG",
2456
+ "CC"
2457
+ ],
2458
+ [
2459
+ "TC",
2460
+ "GC"
2461
+ ],
2462
+ [
2463
+ "AG",
2464
+ "TATT"
2465
+ ],
2466
+ [
2467
+ "AGAA",
2468
+ "CC"
2469
+ ],
2470
+ [
2471
+ "TT",
2472
+ "AAAC"
2473
+ ],
2474
+ [
2475
+ "AA",
2476
+ "ATTC"
2477
+ ],
2478
+ [
2479
+ "AG",
2480
+ "AGAC"
2481
+ ],
2482
+ [
2483
+ "ATT",
2484
+ "TAC"
2485
+ ],
2486
+ [
2487
+ "ATTG",
2488
+ "CC"
2489
+ ],
2490
+ [
2491
+ "AAC",
2492
+ "AAC"
2493
+ ],
2494
+ [
2495
+ "TT",
2496
+ "TAAC"
2497
+ ],
2498
+ [
2499
+ "AC",
2500
+ "GG"
2501
+ ],
2502
+ [
2503
+ "AAG",
2504
+ "AAAA"
2505
+ ],
2506
+ [
2507
+ "TCTG",
2508
+ "GC"
2509
+ ],
2510
+ [
2511
+ "ATTC",
2512
+ "TCC"
2513
+ ],
2514
+ [
2515
+ "AGG",
2516
+ "TGG"
2517
+ ],
2518
+ [
2519
+ "TGC",
2520
+ "TGC"
2521
+ ],
2522
+ [
2523
+ "TTC",
2524
+ "AAG"
2525
+ ],
2526
+ [
2527
+ "AG",
2528
+ "AGGG"
2529
+ ],
2530
+ [
2531
+ "AC",
2532
+ "ACC"
2533
+ ],
2534
+ [
2535
+ "TC",
2536
+ "TTTT"
2537
+ ],
2538
+ [
2539
+ "AG",
2540
+ "AGGC"
2541
+ ],
2542
+ [
2543
+ "ATC",
2544
+ "ACC"
2545
+ ],
2546
+ [
2547
+ "TAA",
2548
+ "ATT"
2549
+ ],
2550
+ [
2551
+ "AAGG",
2552
+ "CC"
2553
+ ],
2554
+ [
2555
+ "TTGC",
2556
+ "AGTG"
2557
+ ],
2558
+ [
2559
+ "TG",
2560
+ "TAC"
2561
+ ],
2562
+ [
2563
+ "AA",
2564
+ "TTTC"
2565
+ ],
2566
+ [
2567
+ "ATCC",
2568
+ "CC"
2569
+ ],
2570
+ [
2571
+ "AC",
2572
+ "AAG"
2573
+ ],
2574
+ [
2575
+ "AC",
2576
+ "AGG"
2577
+ ],
2578
+ [
2579
+ "AC",
2580
+ "AAC"
2581
+ ],
2582
+ [
2583
+ "TGCC",
2584
+ "CC"
2585
+ ],
2586
+ [
2587
+ "AG",
2588
+ "ATTC"
2589
+ ],
2590
+ [
2591
+ "TT",
2592
+ "AGAA"
2593
+ ],
2594
+ [
2595
+ "TTGG",
2596
+ "GG"
2597
+ ],
2598
+ [
2599
+ "AG",
2600
+ "ACAC"
2601
+ ],
2602
+ [
2603
+ "TGG",
2604
+ "AAG"
2605
+ ],
2606
+ [
2607
+ "ACC",
2608
+ "TCC"
2609
+ ],
2610
+ [
2611
+ "ATG",
2612
+ "GGG"
2613
+ ],
2614
+ [
2615
+ "AGCC",
2616
+ "TCC"
2617
+ ],
2618
+ [
2619
+ "TT",
2620
+ "ATTG"
2621
+ ],
2622
+ [
2623
+ "TAA",
2624
+ "AAG"
2625
+ ],
2626
+ [
2627
+ "ATC",
2628
+ "TTC"
2629
+ ],
2630
+ [
2631
+ "ATC",
2632
+ "TCC"
2633
+ ],
2634
+ [
2635
+ "TGAA",
2636
+ "GC"
2637
+ ],
2638
+ [
2639
+ "TAA",
2640
+ "TC"
2641
+ ],
2642
+ [
2643
+ "AA",
2644
+ "ATGC"
2645
+ ],
2646
+ [
2647
+ "TTG",
2648
+ "TTG"
2649
+ ],
2650
+ [
2651
+ "ATT",
2652
+ "CCC"
2653
+ ],
2654
+ [
2655
+ "TAC",
2656
+ "TAAAA"
2657
+ ],
2658
+ [
2659
+ "AT",
2660
+ "AGTG"
2661
+ ],
2662
+ [
2663
+ "AA",
2664
+ "ATAC"
2665
+ ],
2666
+ [
2667
+ "TTGG",
2668
+ "GC"
2669
+ ],
2670
+ [
2671
+ "TAG",
2672
+ "AGAC"
2673
+ ],
2674
+ [
2675
+ "TG",
2676
+ "TTTT"
2677
+ ],
2678
+ [
2679
+ "TTC",
2680
+ "TGC"
2681
+ ],
2682
+ [
2683
+ "TGG",
2684
+ "CCC"
2685
+ ],
2686
+ [
2687
+ "TCTG",
2688
+ "TC"
2689
+ ],
2690
+ [
2691
+ "AGC",
2692
+ "TCC"
2693
+ ],
2694
+ [
2695
+ "AAC",
2696
+ "TCC"
2697
+ ],
2698
+ [
2699
+ "TT",
2700
+ "AGCC"
2701
+ ],
2702
+ [
2703
+ "AAAG",
2704
+ "TGCTGGG"
2705
+ ],
2706
+ [
2707
+ "AT",
2708
+ "AGAC"
2709
+ ],
2710
+ [
2711
+ "TATT",
2712
+ "TTTAG"
2713
+ ],
2714
+ [
2715
+ "AC",
2716
+ "TTG"
2717
+ ],
2718
+ [
2719
+ "ACC",
2720
+ "ACC"
2721
+ ],
2722
+ [
2723
+ "AA",
2724
+ "ACAC"
2725
+ ],
2726
+ [
2727
+ "G",
2728
+ "TGG"
2729
+ ],
2730
+ [
2731
+ "ATT",
2732
+ "TAG"
2733
+ ],
2734
+ [
2735
+ "AGG",
2736
+ "AGC"
2737
+ ],
2738
+ [
2739
+ "AGGC",
2740
+ "TGGAGTGC"
2741
+ ],
2742
+ [
2743
+ "AT",
2744
+ "ACCC"
2745
+ ],
2746
+ [
2747
+ "ATG",
2748
+ "TAA"
2749
+ ],
2750
+ [
2751
+ "AC",
2752
+ "GC"
2753
+ ],
2754
+ [
2755
+ "AG",
2756
+ "TAT"
2757
+ ],
2758
+ [
2759
+ "TT",
2760
+ "TACC"
2761
+ ],
2762
+ [
2763
+ "AC",
2764
+ "TAA"
2765
+ ],
2766
+ [
2767
+ "AGG",
2768
+ "CCC"
2769
+ ],
2770
+ [
2771
+ "AAGG",
2772
+ "GG"
2773
+ ],
2774
+ [
2775
+ "TCTC",
2776
+ "G"
2777
+ ],
2778
+ [
2779
+ "ATG",
2780
+ "AAG"
2781
+ ],
2782
+ [
2783
+ "AA",
2784
+ "AGAC"
2785
+ ],
2786
+ [
2787
+ "TG",
2788
+ "AAAA"
2789
+ ],
2790
+ [
2791
+ "AAGG",
2792
+ "GC"
2793
+ ],
2794
+ [
2795
+ "AT",
2796
+ "AGGC"
2797
+ ],
2798
+ [
2799
+ "AG",
2800
+ "AGTG"
2801
+ ],
2802
+ [
2803
+ "AGC",
2804
+ "TGC"
2805
+ ],
2806
+ [
2807
+ "ATG",
2808
+ "TTC"
2809
+ ],
2810
+ [
2811
+ "TATT",
2812
+ "TC"
2813
+ ],
2814
+ [
2815
+ "TG",
2816
+ "ATC"
2817
+ ],
2818
+ [
2819
+ "AG",
2820
+ "TTTG"
2821
+ ],
2822
+ [
2823
+ "AGC",
2824
+ "TAA"
2825
+ ],
2826
+ [
2827
+ "AG",
2828
+ "AGCC"
2829
+ ],
2830
+ [
2831
+ "TGC",
2832
+ "TTC"
2833
+ ],
2834
+ [
2835
+ "ATC",
2836
+ "ATC"
2837
+ ],
2838
+ [
2839
+ "AAC",
2840
+ "ATGG"
2841
+ ],
2842
+ [
2843
+ "AGC",
2844
+ "TTC"
2845
+ ],
2846
+ [
2847
+ "AAG",
2848
+ "AAC"
2849
+ ],
2850
+ [
2851
+ "TTTT",
2852
+ "TTG"
2853
+ ],
2854
+ [
2855
+ "AGGG",
2856
+ "GG"
2857
+ ],
2858
+ [
2859
+ "ATAA",
2860
+ "GC"
2861
+ ],
2862
+ [
2863
+ "TAAG",
2864
+ "CC"
2865
+ ],
2866
+ [
2867
+ "AC",
2868
+ "TGG"
2869
+ ],
2870
+ [
2871
+ "AC",
2872
+ "AAAA"
2873
+ ],
2874
+ [
2875
+ "ATC",
2876
+ "ATT"
2877
+ ],
2878
+ [
2879
+ "TC",
2880
+ "TTTC"
2881
+ ],
2882
+ [
2883
+ "ATG",
2884
+ "ATG"
2885
+ ],
2886
+ [
2887
+ "TGC",
2888
+ "AA"
2889
+ ],
2890
+ [
2891
+ "AGG",
2892
+ "TTC"
2893
+ ],
2894
+ [
2895
+ "AAC",
2896
+ "ATT"
2897
+ ],
2898
+ [
2899
+ "ATG",
2900
+ "GGC"
2901
+ ],
2902
+ [
2903
+ "AT",
2904
+ "AGAG"
2905
+ ],
2906
+ [
2907
+ "AA",
2908
+ "ATGG"
2909
+ ],
2910
+ [
2911
+ "AG",
2912
+ "TTCC"
2913
+ ],
2914
+ [
2915
+ "TT",
2916
+ "TAGC"
2917
+ ],
2918
+ [
2919
+ "AAC",
2920
+ "TTC"
2921
+ ],
2922
+ [
2923
+ "AGC",
2924
+ "AAG"
2925
+ ],
2926
+ [
2927
+ "AT",
2928
+ "AAAAC"
2929
+ ],
2930
+ [
2931
+ "AAAA",
2932
+ "TC"
2933
+ ],
2934
+ [
2935
+ "AGCC",
2936
+ "AC"
2937
+ ],
2938
+ [
2939
+ "AGG",
2940
+ "AAC"
2941
+ ],
2942
+ [
2943
+ "TTAA",
2944
+ "CC"
2945
+ ],
2946
+ [
2947
+ "TATT",
2948
+ "TATT"
2949
+ ],
2950
+ [
2951
+ "TTTC",
2952
+ "TG"
2953
+ ],
2954
+ [
2955
+ "ATAA",
2956
+ "GG"
2957
+ ],
2958
+ [
2959
+ "AGCC",
2960
+ "ACC"
2961
+ ],
2962
+ [
2963
+ "AG",
2964
+ "ATGC"
2965
+ ],
2966
+ [
2967
+ "TTAA",
2968
+ "GC"
2969
+ ],
2970
+ [
2971
+ "TTG",
2972
+ "TAA"
2973
+ ],
2974
+ [
2975
+ "AG",
2976
+ "TGTG"
2977
+ ],
2978
+ [
2979
+ "AACC",
2980
+ "CC"
2981
+ ],
2982
+ [
2983
+ "TTC",
2984
+ "ATT"
2985
+ ],
2986
+ [
2987
+ "ATC",
2988
+ "ATG"
2989
+ ],
2990
+ [
2991
+ "AA",
2992
+ "TGAA"
2993
+ ],
2994
+ [
2995
+ "AGG",
2996
+ "TGC"
2997
+ ],
2998
+ [
2999
+ "AAAAAAAA",
3000
+ "AAAAAAAA"
3001
+ ],
3002
+ [
3003
+ "AGG",
3004
+ "ATG"
3005
+ ],
3006
+ [
3007
+ "AGCC",
3008
+ "G"
3009
+ ],
3010
+ [
3011
+ "TGG",
3012
+ "TGG"
3013
+ ],
3014
+ [
3015
+ "AG",
3016
+ "TGGG"
3017
+ ],
3018
+ [
3019
+ "TGCAC",
3020
+ "TCCAGCC"
3021
+ ],
3022
+ [
3023
+ "TATT",
3024
+ "GC"
3025
+ ],
3026
+ [
3027
+ "TAG",
3028
+ "TC"
3029
+ ],
3030
+ [
3031
+ "CCC",
3032
+ "G"
3033
+ ],
3034
+ [
3035
+ "AAG",
3036
+ "TAA"
3037
+ ],
3038
+ [
3039
+ "TAG",
3040
+ "TG"
3041
+ ],
3042
+ [
3043
+ "TTTTTTTT",
3044
+ "TTTTTTTT"
3045
+ ],
3046
+ [
3047
+ "AGC",
3048
+ "ATT"
3049
+ ],
3050
+ [
3051
+ "ATC",
3052
+ "TGC"
3053
+ ],
3054
+ [
3055
+ "TCTC",
3056
+ "AC"
3057
+ ],
3058
+ [
3059
+ "AA",
3060
+ "ATTG"
3061
+ ],
3062
+ [
3063
+ "TT",
3064
+ "TAGG"
3065
+ ],
3066
+ [
3067
+ "AG",
3068
+ "ACCC"
3069
+ ],
3070
+ [
3071
+ "GGG",
3072
+ "CC"
3073
+ ],
3074
+ [
3075
+ "TCC",
3076
+ "TTC"
3077
+ ],
3078
+ [
3079
+ "AT",
3080
+ "AGGG"
3081
+ ],
3082
+ [
3083
+ "AA",
3084
+ "TATG"
3085
+ ],
3086
+ [
3087
+ "TT",
3088
+ "ATAC"
3089
+ ],
3090
+ [
3091
+ "TAG",
3092
+ "AAG"
3093
+ ],
3094
+ [
3095
+ "AA",
3096
+ "AGTG"
3097
+ ],
3098
+ [
3099
+ "AA",
3100
+ "ATCC"
3101
+ ],
3102
+ [
3103
+ "TTCC",
3104
+ "TC"
3105
+ ],
3106
+ [
3107
+ "TTTC",
3108
+ "AC"
3109
+ ],
3110
+ [
3111
+ "AG",
3112
+ "TATG"
3113
+ ],
3114
+ [
3115
+ "TACTAAAA",
3116
+ "ATAC"
3117
+ ],
3118
+ [
3119
+ "ATG",
3120
+ "TGC"
3121
+ ],
3122
+ [
3123
+ "AGG",
3124
+ "AGGC"
3125
+ ],
3126
+ [
3127
+ "TAT",
3128
+ "ATC"
3129
+ ],
3130
+ [
3131
+ "TTC",
3132
+ "TAA"
3133
+ ],
3134
+ [
3135
+ "TG",
3136
+ "AGGC"
3137
+ ],
3138
+ [
3139
+ "ACAC",
3140
+ "AC"
3141
+ ],
3142
+ [
3143
+ "TCC",
3144
+ "CCC"
3145
+ ],
3146
+ [
3147
+ "AAC",
3148
+ "ATC"
3149
+ ],
3150
+ [
3151
+ "AAGC",
3152
+ "G"
3153
+ ],
3154
+ [
3155
+ "AA",
3156
+ "TGGC"
3157
+ ],
3158
+ [
3159
+ "ACC",
3160
+ "CCC"
3161
+ ],
3162
+ [
3163
+ "AG",
3164
+ "ATAC"
3165
+ ],
3166
+ [
3167
+ "AT",
3168
+ "AAAAG"
3169
+ ],
3170
+ [
3171
+ "ATG",
3172
+ "ATT"
3173
+ ],
3174
+ [
3175
+ "TGG",
3176
+ "AGG"
3177
+ ],
3178
+ [
3179
+ "AG",
3180
+ "TTAA"
3181
+ ]
3182
+ ]
3183
+ }
3184
+ }
tokenizer_config.json ADDED
@@ -0,0 +1,59 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "4": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": true,
45
+ "cls_token": "[CLS]",
46
+ "do_lower_case": false,
47
+ "extra_special_tokens": {},
48
+ "mask_token": "[MASK]",
49
+ "model_max_length": 512,
50
+ "pad_token": "[PAD]",
51
+ "special_tokens": {
52
+ "mask_token": "[MASK]",
53
+ "pad_token": "[PAD]",
54
+ "sep_token": "[SEP]"
55
+ },
56
+ "tokenize_chinese_chars": false,
57
+ "tokenizer_class": "PreTrainedTokenizerFast",
58
+ "trust_remote_code": true
59
+ }