janko commited on
Commit
93670af
1 Parent(s): ff24e53

Retrain with spaCy v3.7

Browse files
README.md CHANGED
@@ -4,7 +4,6 @@ tags:
4
  - token-classification
5
  language:
6
  - grc
7
- license: mit
8
  model-index:
9
  - name: grc_odycy_joint_trf
10
  results:
@@ -14,80 +13,61 @@ model-index:
14
  metrics:
15
  - name: TAG (XPOS) Accuracy
16
  type: accuracy
17
- value: 0.9418372337
18
  - task:
19
  name: POS
20
  type: token-classification
21
  metrics:
22
  - name: POS (UPOS) Accuracy
23
  type: accuracy
24
- value: 0.9732169053
25
  - task:
26
  name: MORPH
27
  type: token-classification
28
  metrics:
29
  - name: Morph (UFeats) Accuracy
30
  type: accuracy
31
- value: 0.9409003269
32
  - task:
33
  name: LEMMA
34
  type: token-classification
35
  metrics:
36
  - name: Lemma Accuracy
37
  type: accuracy
38
- value: 0.9388661536
39
  - task:
40
  name: UNLABELED_DEPENDENCIES
41
  type: token-classification
42
  metrics:
43
  - name: Unlabeled Attachment Score (UAS)
44
  type: f_score
45
- value: 0.813958352
46
  - task:
47
  name: LABELED_DEPENDENCIES
48
  type: token-classification
49
  metrics:
50
  - name: Labeled Attachment Score (LAS)
51
  type: f_score
52
- value: 0.7641839204
53
  - task:
54
  name: SENTS
55
  type: token-classification
56
  metrics:
57
  - name: Sentences F-Score
58
  type: f_score
59
- value: 0.8301282051
60
  ---
61
- <p align="center">
62
- <img width="200" src="https://github.com/centre-for-humanities-computing/odyCy/raw/main/docs/_static/logo_with_text_below.svg">
63
- <div align="center" style="color: #2c5882; font-weight: bold; font-size: 18px; margin-top: -20px;">
64
- A general-purpose NLP pipeline for Ancient-Greek.
65
- </div>
66
- </p>
67
-
68
- [![Open in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/github/centre-for-humanities-computing/odyCy/blob/main/tutorials/01_odycy_getting_started.ipynb#&offline=true&sandboxMode=true)
69
- Check out our Documentation on [Basic Usage](https://centre-for-humanities-computing.github.io/odyCy/getting_started.html).
70
-
71
-
72
- ## Performance
73
-
74
- odyCy achieves state of the art performance on multiple tasks on unseen test data from the Universal Dependencies Perseus treebank,
75
- and performs second best on the PROIEL treebank’s test set on even more tasks.
76
- In addition performance also seems relatively stable across the two evaluation datasets in comparison with other NLP pipelines.
77
-
78
- For plots and tables on OdyCy's performance, check out the Documentation page on [Performance](https://centre-for-humanities-computing.github.io/odyCy/performance.html)
79
-
80
  | Feature | Description |
81
  | --- | --- |
82
  | **Name** | `grc_odycy_joint_trf` |
83
- | **Version** | `0.6.0` |
84
- | **spaCy** | `>=3.5.0,<3.6.0` |
85
  | **Default Pipeline** | `transformer`, `tagger`, `morphologizer`, `parser`, `trainable_lemmatizer`, `frequency_lemmatizer` |
86
  | **Components** | `transformer`, `tagger`, `morphologizer`, `parser`, `trainable_lemmatizer`, `frequency_lemmatizer` |
87
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
88
  | **Sources** | n/a |
89
- | **License** | `MIT` |
90
- | **Author** | [{Jan Kostkan, Márton Kardos}](https://github.com/centre-for-humanities-computing/odyCy) |
91
 
92
  ### Label Scheme
93
 
@@ -107,17 +87,17 @@ For plots and tables on OdyCy's performance, check out the Documentation page on
107
 
108
  | Type | Score |
109
  | --- | --- |
110
- | `TAG_ACC` | 94.18 |
111
- | `POS_ACC` | 97.32 |
112
- | `MORPH_ACC` | 94.09 |
113
- | `DEP_UAS` | 81.40 |
114
- | `DEP_LAS` | 76.42 |
115
- | `SENTS_P` | 81.96 |
116
- | `SENTS_R` | 84.09 |
117
- | `SENTS_F` | 83.01 |
118
- | `LEMMA_ACC` | 93.89 |
119
- | `TRANSFORMER_LOSS` | 2030746.62 |
120
- | `TAGGER_LOSS` | 27277.06 |
121
- | `MORPHOLOGIZER_LOSS` | 66640.79 |
122
- | `PARSER_LOSS` | 2650835.48 |
123
- | `TRAINABLE_LEMMATIZER_LOSS` | 170902.76 |
 
4
  - token-classification
5
  language:
6
  - grc
 
7
  model-index:
8
  - name: grc_odycy_joint_trf
9
  results:
 
13
  metrics:
14
  - name: TAG (XPOS) Accuracy
15
  type: accuracy
16
+ value: 0.9415167095
17
  - task:
18
  name: POS
19
  type: token-classification
20
  metrics:
21
  - name: POS (UPOS) Accuracy
22
  type: accuracy
23
+ value: 0.9720856153
24
  - task:
25
  name: MORPH
26
  type: token-classification
27
  metrics:
28
  - name: Morph (UFeats) Accuracy
29
  type: accuracy
30
+ value: 0.9415709615
31
  - task:
32
  name: LEMMA
33
  type: token-classification
34
  metrics:
35
  - name: Lemma Accuracy
36
  type: accuracy
37
+ value: 0.938530944
38
  - task:
39
  name: UNLABELED_DEPENDENCIES
40
  type: token-classification
41
  metrics:
42
  - name: Unlabeled Attachment Score (UAS)
43
  type: f_score
44
+ value: 0.8192166353
45
  - task:
46
  name: LABELED_DEPENDENCIES
47
  type: token-classification
48
  metrics:
49
  - name: Labeled Attachment Score (LAS)
50
  type: f_score
51
+ value: 0.768564301
52
  - task:
53
  name: SENTS
54
  type: token-classification
55
  metrics:
56
  - name: Sentences F-Score
57
  type: f_score
58
+ value: 0.8352455255
59
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
60
  | Feature | Description |
61
  | --- | --- |
62
  | **Name** | `grc_odycy_joint_trf` |
63
+ | **Version** | `0.7.0` |
64
+ | **spaCy** | `>=3.7.4,<3.8.0` |
65
  | **Default Pipeline** | `transformer`, `tagger`, `morphologizer`, `parser`, `trainable_lemmatizer`, `frequency_lemmatizer` |
66
  | **Components** | `transformer`, `tagger`, `morphologizer`, `parser`, `trainable_lemmatizer`, `frequency_lemmatizer` |
67
  | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
68
  | **Sources** | n/a |
69
+ | **License** | n/a |
70
+ | **Author** | [n/a]() |
71
 
72
  ### Label Scheme
73
 
 
87
 
88
  | Type | Score |
89
  | --- | --- |
90
+ | `TAG_ACC` | 94.15 |
91
+ | `POS_ACC` | 97.21 |
92
+ | `MORPH_ACC` | 94.16 |
93
+ | `DEP_UAS` | 81.92 |
94
+ | `DEP_LAS` | 76.86 |
95
+ | `SENTS_P` | 82.65 |
96
+ | `SENTS_R` | 84.42 |
97
+ | `SENTS_F` | 83.52 |
98
+ | `LEMMA_ACC` | 93.85 |
99
+ | `TRANSFORMER_LOSS` | 875117.52 |
100
+ | `TAGGER_LOSS` | 8013.69 |
101
+ | `MORPHOLOGIZER_LOSS` | 18489.32 |
102
+ | `PARSER_LOSS` | 2193434.41 |
103
+ | `TRAINABLE_LEMMATIZER_LOSS` | 34833.09 |
config.cfg CHANGED
@@ -18,6 +18,7 @@ before_creation = null
18
  after_creation = null
19
  after_pipeline_creation = null
20
  tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
 
21
 
22
  [components]
23
 
@@ -29,6 +30,7 @@ overwrite = true
29
  [components.morphologizer]
30
  factory = "morphologizer"
31
  extend = false
 
32
  overwrite = true
33
  scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
34
 
@@ -68,6 +70,7 @@ upstream = "*"
68
 
69
  [components.tagger]
70
  factory = "tagger"
 
71
  neg_prefix = "!"
72
  overwrite = false
73
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
 
18
  after_creation = null
19
  after_pipeline_creation = null
20
  tokenizer = {"@tokenizers":"spacy.Tokenizer.v1"}
21
+ vectors = {"@vectors":"spacy.Vectors.v1"}
22
 
23
  [components]
24
 
 
30
  [components.morphologizer]
31
  factory = "morphologizer"
32
  extend = false
33
+ label_smoothing = 0.0
34
  overwrite = true
35
  scorer = {"@scorers":"spacy.morphologizer_scorer.v1"}
36
 
 
70
 
71
  [components.tagger]
72
  factory = "tagger"
73
+ label_smoothing = 0.0
74
  neg_prefix = "!"
75
  overwrite = false
76
  scorer = {"@scorers":"spacy.tagger_scorer.v1"}
grc_odycy_joint_trf-any-py3-none-any.whl CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e72e0180d0f56c1ace27f38f3c147c04190263fecc8d31d030662dfcc0f83164
3
- size 497308444
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8a828bb5d105e0f2d85d22479ab487fd9b5bacb9fdfb7098245e5e5aaefecd5e
3
+ size 497296758
meta.json CHANGED
@@ -1,14 +1,14 @@
1
  {
2
  "lang":"grc",
3
  "name":"odycy_joint_trf",
4
- "version":"0.6.0",
5
- "description":"Ancient Greek transformer pipeline (based on pranaydeeps/Ancient-Greek-BERT)",
6
- "author":"{Jan Kostkan, M\u00e1rton Kardos}",
7
  "email":"",
8
- "url":"https://github.com/centre-for-humanities-computing/odyCy",
9
- "license":"MIT",
10
- "spacy_version":">=3.5.0,<3.6.0",
11
- "spacy_git_version":"Unknown",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
@@ -2348,74 +2348,74 @@
2348
 
2349
  ],
2350
  "performance":{
2351
- "tag_acc":0.9418372337,
2352
- "pos_acc":0.9732169053,
2353
- "morph_acc":0.9409003269,
2354
  "morph_per_feat":{
2355
  "Case":{
2356
- "p":0.9775269209,
2357
- "r":0.9754970669,
2358
- "f":0.976510939
2359
  },
2360
  "Gender":{
2361
- "p":0.9487192863,
2362
- "r":0.9493135668,
2363
- "f":0.9490163336
2364
  },
2365
  "Number":{
2366
- "p":0.9898611897,
2367
- "r":0.9878837886,
2368
- "f":0.9888715006
2369
  },
2370
  "Person":{
2371
- "p":0.9832460733,
2372
- "r":0.97710718,
2373
- "f":0.9801670146
2374
  },
2375
  "PronType":{
2376
- "p":0.98440039,
2377
- "r":0.9834415584,
2378
- "f":0.9839207406
2379
  },
2380
  "Polarity":{
2381
- "p":0.9947368421,
2382
- "r":0.9792746114,
2383
- "f":0.9869451697
2384
  },
2385
  "Aspect":{
2386
- "p":0.9682539683,
2387
- "r":0.9682539683,
2388
- "f":0.9682539683
2389
  },
2390
  "Mood":{
2391
- "p":0.9811069718,
2392
  "r":0.9795430393,
2393
- "f":0.9803243818
2394
  },
2395
  "Tense":{
2396
- "p":0.9734929457,
2397
- "r":0.9705882353,
2398
- "f":0.9720384205
2399
  },
2400
  "VerbForm":{
2401
- "p":0.9923065964,
2402
- "r":0.9893465909,
2403
- "f":0.990824383
2404
  },
2405
  "Voice":{
2406
- "p":0.9783352338,
2407
- "r":0.9756929638,
2408
- "f":0.9770123123
2409
  },
2410
  "Degree":{
2411
- "p":0.9206008584,
2412
- "r":0.9387308534,
2413
- "f":0.9295774648
2414
  },
2415
  "Definite":{
2416
- "p":0.9914209115,
2417
- "r":0.998919503,
2418
- "f":0.9951560818
2419
  },
2420
  "Reflex":{
2421
  "p":1.0,
@@ -2428,113 +2428,113 @@
2428
  "f":0.9444444444
2429
  }
2430
  },
2431
- "dep_uas":0.813958352,
2432
- "dep_las":0.7641839204,
2433
  "dep_las_per_type":{
2434
  "nsubj":{
2435
- "p":0.7885687732,
2436
- "r":0.7731207289,
2437
- "f":0.780768346
2438
  },
2439
  "discourse":{
2440
- "p":0.8357810414,
2441
- "r":0.8402684564,
2442
- "f":0.8380187416
2443
  },
2444
  "mark":{
2445
- "p":0.8735440932,
2446
- "r":0.8649093904,
2447
- "f":0.869205298
2448
  },
2449
  "advmod":{
2450
- "p":0.7644385027,
2451
- "r":0.7573509934,
2452
- "f":0.7608782435
2453
  },
2454
  "advcl":{
2455
- "p":0.7203791469,
2456
- "r":0.7276595745,
2457
- "f":0.7240010585
2458
  },
2459
  "xcomp":{
2460
- "p":0.5360501567,
2461
- "r":0.5504291845,
2462
- "f":0.5431445209
2463
  },
2464
  "cop":{
2465
- "p":0.7619047619,
2466
- "r":0.7404426559,
2467
- "f":0.7510204082
2468
  },
2469
  "root":{
2470
- "p":0.8865280289,
2471
- "r":0.909554731,
2472
- "f":0.8978937729
2473
  },
2474
  "det":{
2475
- "p":0.9145708583,
2476
- "r":0.9169501701,
2477
- "f":0.9157589687
2478
  },
2479
  "nmod":{
2480
- "p":0.6646473388,
2481
- "r":0.6336633663,
2482
- "f":0.6487856389
2483
  },
2484
  "obj":{
2485
- "p":0.7403100775,
2486
- "r":0.7582373958,
2487
- "f":0.7491665032
2488
  },
2489
  "case":{
2490
- "p":0.9387395328,
2491
- "r":0.9433126661,
2492
- "f":0.9410205434
2493
  },
2494
  "obl":{
2495
- "p":0.6969831411,
2496
- "r":0.7029082774,
2497
- "f":0.69993317
2498
  },
2499
  "cc":{
2500
- "p":0.7226130653,
2501
- "r":0.7182817183,
2502
- "f":0.7204408818
2503
  },
2504
  "conj":{
2505
- "p":0.6913996627,
2506
- "r":0.6570512821,
2507
- "f":0.6737880033
2508
  },
2509
  "obl:agent":{
2510
- "p":0.9130434783,
2511
- "r":0.5675675676,
2512
- "f":0.7
2513
  },
2514
  "ccomp":{
2515
- "p":0.5300925926,
2516
- "r":0.5599022005,
2517
- "f":0.5445897741
2518
  },
2519
  "nsubj:pass":{
2520
- "p":0.7019230769,
2521
  "r":0.6822429907,
2522
- "f":0.691943128
2523
  },
2524
  "amod":{
2525
- "p":0.6523297491,
2526
- "r":0.5214899713,
2527
- "f":0.5796178344
2528
  },
2529
  "acl":{
2530
- "p":0.4615384615,
2531
- "r":0.4036697248,
2532
- "f":0.4306688418
2533
  },
2534
  "iobj":{
2535
- "p":0.7075812274,
2536
- "r":0.6889279438,
2537
- "f":0.6981300089
2538
  },
2539
  "dep":{
2540
  "p":0.0,
@@ -2542,54 +2542,49 @@
2542
  "f":0.0
2543
  },
2544
  "nummod":{
2545
- "p":0.7536231884,
2546
  "r":0.619047619,
2547
- "f":0.6797385621
2548
  },
2549
  "vocative":{
2550
- "p":0.7865168539,
2551
- "r":0.7608695652,
2552
- "f":0.773480663
2553
  },
2554
  "orphan":{
2555
- "p":0.1212121212,
2556
- "r":0.0930232558,
2557
- "f":0.1052631579
2558
  },
2559
  "appos":{
2560
- "p":0.4166666667,
2561
- "r":0.3614457831,
2562
- "f":0.3870967742
2563
  },
2564
  "parataxis":{
2565
  "p":0.0,
2566
  "r":0.0,
2567
  "f":0.0
2568
  },
2569
- "csubj":{
2570
- "p":0.5217391304,
2571
- "r":0.2330097087,
2572
- "f":0.322147651
2573
- },
2574
- "fixed":{
2575
- "p":0.1111111111,
2576
- "r":0.6,
2577
- "f":0.1875
2578
- },
2579
  "dislocated":{
2580
- "p":0.2777777778,
2581
- "r":0.1923076923,
2582
- "f":0.2272727273
2583
  },
2584
  "csubj:pass":{
2585
- "p":0.125,
2586
  "r":0.2,
2587
- "f":0.1538461538
2588
  },
2589
  "flat:name":{
2590
- "p":0.8125,
2591
  "r":0.5909090909,
2592
- "f":0.6842105263
 
 
 
 
 
2593
  },
2594
  "aux:pass":{
2595
  "p":0.0,
@@ -2600,19 +2595,24 @@
2600
  "p":0.0,
2601
  "r":0.0,
2602
  "f":0.0
 
 
 
 
 
2603
  }
2604
  },
2605
- "sents_p":0.8196202532,
2606
- "sents_r":0.8409090909,
2607
- "sents_f":0.8301282051,
2608
- "lemma_acc":0.9388661536,
2609
- "transformer_loss":20307.4661576883,
2610
- "tagger_loss":272.7706225453,
2611
- "morphologizer_loss":666.4078642421,
2612
- "parser_loss":26508.3547870876,
2613
- "trainable_lemmatizer_loss":1709.0275796428
2614
  },
2615
  "requirements":[
2616
- "spacy-transformers>=1.1.9,<1.2.0"
2617
  ]
2618
  }
 
1
  {
2
  "lang":"grc",
3
  "name":"odycy_joint_trf",
4
+ "version":"0.7.0",
5
+ "description":"",
6
+ "author":"",
7
  "email":"",
8
+ "url":"",
9
+ "license":"",
10
+ "spacy_version":">=3.7.4,<3.8.0",
11
+ "spacy_git_version":"bff8725f4",
12
  "vectors":{
13
  "width":0,
14
  "vectors":0,
 
2348
 
2349
  ],
2350
  "performance":{
2351
+ "tag_acc":0.9415167095,
2352
+ "pos_acc":0.9720856153,
2353
+ "morph_acc":0.9415709615,
2354
  "morph_per_feat":{
2355
  "Case":{
2356
+ "p":0.9787145459,
2357
+ "r":0.9762757618,
2358
+ "f":0.9774936327
2359
  },
2360
  "Gender":{
2361
+ "p":0.9498591255,
2362
+ "r":0.9503053714,
2363
+ "f":0.9500821961
2364
  },
2365
  "Number":{
2366
+ "p":0.9900352465,
2367
+ "r":0.9880574977,
2368
+ "f":0.9890453834
2369
  },
2370
  "Person":{
2371
+ "p":0.9812186978,
2372
+ "r":0.9785639958,
2373
+ "f":0.9798895488
2374
  },
2375
  "PronType":{
2376
+ "p":0.98157129,
2377
+ "r":0.9857142857,
2378
+ "f":0.9836384254
2379
  },
2380
  "Polarity":{
2381
+ "p":1.0,
2382
+ "r":0.9689119171,
2383
+ "f":0.9842105263
2384
  },
2385
  "Aspect":{
2386
+ "p":0.9671941335,
2387
+ "r":0.9701897019,
2388
+ "f":0.9686896019
2389
  },
2390
  "Mood":{
2391
+ "p":0.9790228359,
2392
  "r":0.9795430393,
2393
+ "f":0.9792828685
2394
  },
2395
  "Tense":{
2396
+ "p":0.9721313806,
2397
+ "r":0.9714407502,
2398
+ "f":0.9717859427
2399
  },
2400
  "VerbForm":{
2401
+ "p":0.9914700028,
2402
+ "r":0.990625,
2403
+ "f":0.9910473213
2404
  },
2405
  "Voice":{
2406
+ "p":0.9796614991,
2407
+ "r":0.9791044776,
2408
+ "f":0.9793829091
2409
  },
2410
  "Degree":{
2411
+ "p":0.9064824655,
2412
+ "r":0.9332603939,
2413
+ "f":0.9196765499
2414
  },
2415
  "Definite":{
2416
+ "p":0.9930145083,
2417
+ "r":0.9983792545,
2418
+ "f":0.9956896552
2419
  },
2420
  "Reflex":{
2421
  "p":1.0,
 
2428
  "f":0.9444444444
2429
  }
2430
  },
2431
+ "dep_uas":0.8192166353,
2432
+ "dep_las":0.768564301,
2433
  "dep_las_per_type":{
2434
  "nsubj":{
2435
+ "p":0.7919741697,
2436
+ "r":0.7822323462,
2437
+ "f":0.7870731148
2438
  },
2439
  "discourse":{
2440
+ "p":0.8364361702,
2441
+ "r":0.844295302,
2442
+ "f":0.8403473614
2443
  },
2444
  "mark":{
2445
+ "p":0.8778501629,
2446
+ "r":0.8879736409,
2447
+ "f":0.8828828829
2448
  },
2449
  "advmod":{
2450
+ "p":0.7775389575,
2451
+ "r":0.7666225166,
2452
+ "f":0.7720421502
2453
  },
2454
  "advcl":{
2455
+ "p":0.7145085803,
2456
+ "r":0.7308510638,
2457
+ "f":0.722587431
2458
  },
2459
  "xcomp":{
2460
+ "p":0.525388601,
2461
+ "r":0.5439914163,
2462
+ "f":0.5345282024
2463
  },
2464
  "cop":{
2465
+ "p":0.7645875252,
2466
+ "r":0.7645875252,
2467
+ "f":0.7645875252
2468
  },
2469
  "root":{
2470
+ "p":0.8955495005,
2471
+ "r":0.9146567718,
2472
+ "f":0.9050022946
2473
  },
2474
  "det":{
2475
+ "p":0.9176798884,
2476
+ "r":0.9213528117,
2477
+ "f":0.9195126822
2478
  },
2479
  "nmod":{
2480
+ "p":0.681352915,
2481
+ "r":0.6316006601,
2482
+ "f":0.6555341469
2483
  },
2484
  "obj":{
2485
+ "p":0.7391472868,
2486
+ "r":0.757046447,
2487
+ "f":0.7479898019
2488
  },
2489
  "case":{
2490
+ "p":0.9376098418,
2491
+ "r":0.9450841453,
2492
+ "f":0.941332157
2493
  },
2494
  "obl":{
2495
+ "p":0.6901098901,
2496
+ "r":0.7024608501,
2497
+ "f":0.6962305987
2498
  },
2499
  "cc":{
2500
+ "p":0.7231695085,
2501
+ "r":0.7202797203,
2502
+ "f":0.7217217217
2503
  },
2504
  "conj":{
2505
+ "p":0.6972375691,
2506
+ "r":0.6741452991,
2507
+ "f":0.6854970125
2508
  },
2509
  "obl:agent":{
2510
+ "p":0.8571428571,
2511
+ "r":0.4864864865,
2512
+ "f":0.6206896552
2513
  },
2514
  "ccomp":{
2515
+ "p":0.5212264151,
2516
+ "r":0.5403422983,
2517
+ "f":0.5306122449
2518
  },
2519
  "nsubj:pass":{
2520
+ "p":0.7156862745,
2521
  "r":0.6822429907,
2522
+ "f":0.6985645933
2523
  },
2524
  "amod":{
2525
+ "p":0.606779661,
2526
+ "r":0.5128939828,
2527
+ "f":0.5559006211
2528
  },
2529
  "acl":{
2530
+ "p":0.4928057554,
2531
+ "r":0.4189602446,
2532
+ "f":0.452892562
2533
  },
2534
  "iobj":{
2535
+ "p":0.6984126984,
2536
+ "r":0.6959578207,
2537
+ "f":0.6971830986
2538
  },
2539
  "dep":{
2540
  "p":0.0,
 
2542
  "f":0.0
2543
  },
2544
  "nummod":{
2545
+ "p":0.6753246753,
2546
  "r":0.619047619,
2547
+ "f":0.6459627329
2548
  },
2549
  "vocative":{
2550
+ "p":0.7472527473,
2551
+ "r":0.7391304348,
2552
+ "f":0.7431693989
2553
  },
2554
  "orphan":{
2555
+ "p":0.2222222222,
2556
+ "r":0.1860465116,
2557
+ "f":0.2025316456
2558
  },
2559
  "appos":{
2560
+ "p":0.380952381,
2561
+ "r":0.3373493976,
2562
+ "f":0.357827476
2563
  },
2564
  "parataxis":{
2565
  "p":0.0,
2566
  "r":0.0,
2567
  "f":0.0
2568
  },
 
 
 
 
 
 
 
 
 
 
2569
  "dislocated":{
2570
+ "p":0.1428571429,
2571
+ "r":0.1153846154,
2572
+ "f":0.1276595745
2573
  },
2574
  "csubj:pass":{
2575
+ "p":0.2,
2576
  "r":0.2,
2577
+ "f":0.2
2578
  },
2579
  "flat:name":{
2580
+ "p":0.9285714286,
2581
  "r":0.5909090909,
2582
+ "f":0.7222222222
2583
+ },
2584
+ "fixed":{
2585
+ "p":0.1764705882,
2586
+ "r":0.6,
2587
+ "f":0.2727272727
2588
  },
2589
  "aux:pass":{
2590
  "p":0.0,
 
2595
  "p":0.0,
2596
  "r":0.0,
2597
  "f":0.0
2598
+ },
2599
+ "csubj":{
2600
+ "p":0.5517241379,
2601
+ "r":0.3106796117,
2602
+ "f":0.397515528
2603
  }
2604
  },
2605
+ "sents_p":0.8265213442,
2606
+ "sents_r":0.8441558442,
2607
+ "sents_f":0.8352455255,
2608
+ "lemma_acc":0.938530944,
2609
+ "transformer_loss":8751.1752120625,
2610
+ "tagger_loss":80.136901518,
2611
+ "morphologizer_loss":184.8931842226,
2612
+ "parser_loss":21934.3441275962,
2613
+ "trainable_lemmatizer_loss":348.3308892689
2614
  },
2615
  "requirements":[
2616
+ "spacy-transformers>=1.3.4,<1.4.0"
2617
  ]
2618
  }
morphologizer/cfg CHANGED
@@ -1,5 +1,6 @@
1
  {
2
  "extend":false,
 
3
  "labels_morph":{
4
  "Case=Gen|Gender=Masc|Number=Sing|POS=PROPN":"Case=Gen|Gender=Masc|Number=Sing",
5
  "Case=Gen|Gender=Masc|Number=Sing|POS=NOUN":"Case=Gen|Gender=Masc|Number=Sing",
 
1
  {
2
  "extend":false,
3
+ "label_smoothing":0.0,
4
  "labels_morph":{
5
  "Case=Gen|Gender=Masc|Number=Sing|POS=PROPN":"Case=Gen|Gender=Masc|Number=Sing",
6
  "Case=Gen|Gender=Masc|Number=Sing|POS=NOUN":"Case=Gen|Gender=Masc|Number=Sing",
morphologizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0eeff40f67250aacdbb0c93c0815becad2b1e400f71d06063e672a9994807576
3
  size 4408561
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4c78a510a6a0cf6831eab2239751edeb879214f8119e44b9fdbb73d1b259f39a
3
  size 4408561
parser/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e292087fce74cce8ddabe32dcf0e55ff588fcc5caa2b6e460c81aaf79a59ef58
3
  size 2075321
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:854e4dd7a628283a736e7d75da8d27eacda607f19fb49afebca3b6c27efcbeaf
3
  size 2075321
tagger/cfg CHANGED
@@ -1,4 +1,5 @@
1
  {
 
2
  "labels":[
3
  "---------",
4
  "--p---fa-",
 
1
  {
2
+ "label_smoothing":0.0,
3
  "labels":[
4
  "---------",
5
  "--p---fa-",
tagger/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:a0a79e1afe28e4f918648ba940fd9ba2d4cc72dc0e051237590a2f73e90a2af1
3
  size 2562961
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b0d49b315c65ab8a7e79c19cea2bf0c9177f3f20ca59d231f0fec2406b3880d
3
  size 2562961
tokenizer CHANGED
@@ -1,4 +1,4 @@
1
- ��prefix_search� {^†|^⸏|^…|^……|^,|^:|^;|^\!|^\?|^¿|^؟|^¡|^\(|^\)|^\[|^\]|^\{|^\}|^<|^>|^_|^#|^\*|^&|^。|^?|^!|^,|^、|^;|^:|^~|^·|^।|^،|^۔|^؛|^٪|^\.\.+|^…|^\'|^"|^”|^“|^`|^‘|^´|^’|^‚|^,|^„|^»|^«|^「|^」|^『|^』|^(|^)|^〔|^〕|^【|^】|^《|^》|^〈|^〉|^〈|^〉|^⟦|^⟧|^\$|^£|^€|^¥|^฿|^US\$|^C\$|^A\$|^₽|^﷼|^₴|^₠|^₡|^₢|^₣|^₤|^₥|^₦|^₧|^₨|^₩|^₪|^₫|^€|^₭|^₮|^₯|^₰|^₱|^₲|^₳|^₴|^₵|^₶|^₷|^₸|^₹|^₺|^₻|^₼|^₽|^₾|^₿|^[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]�suffix_search�
2
- �…$|……$|,$|:$|;$|\!$|\?$|¿$|؟$|¡$|\($|\)$|\[$|\]$|\{$|\}$|<$|>$|_$|#$|\*$|&$|。$|?$|!$|,$|、$|;$|:$|~$|·$|।$|،$|۔$|؛$|٪$|\.\.+$|…$|\'$|"$|”$|“$|`$|‘$|´$|’$|‚$|,$|„$|»$|«$|「$|」$|『$|』$|($|)$|〔$|〕$|【$|】$|《$|》$|〈$|〉$|〈$|〉$|⟦$|⟧$|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]$|†$|⸎$|(?<=[\u1F00-\u1FFF\u0370-\u03FF])[\-\.⸏]$�infix_finditer�?!\.\.+|…|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]|(?<=[0-9])[+\-\*^](?=[0-9-])|(?<=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉〈〉⟦⟧])\.(?=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉〈〉⟦⟧])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]),(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])(?:-|–|—|--|---|——|~)(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])[:<>=/](?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[\u1F00-\u1FFF\u0370-\u03FF])—�token_match��url_match�
3
  ��A�
4
  � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�8)��A�8)�8-)��A�8-)�8-D��A�8-D�8D��A�8D�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�C++��A�C++�O.O��A�O.O�O.o��A�O.o�O_O��A�O_O�O_o��A�O_o�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�b.��A�b.�c.��A�c.�d.��A�d.�e.��A�e.�f.��A�f.�g.��A�g.�h.��A�h.�i.��A�i.�j.��A�j.�k.��A�k.�l.��A�l.�m.��A�m.�n.��A�n.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�p.��A�p.�q.��A�q.�r.��A�r.�s.��A�s.�t.��A�t.�u.��A�u.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ä.��A�ä.�ö.��A�ö.�ü.��A�ü.�Δ'��A�Δ'C�δέ�Δι'��A�Δι'C�διά�Δι’��A�Δι’C�διά�Δ’��A�Δ’C�δέ�Εφ'��A�Εφ'C�επί�Εφ’��A�Εφ’C�επί�Καθ'��A�Καθ'C�κατά�Καθ’��A�Καθ’C�κατά�Κατ'��A�Κατ'C�κατά�Κατ’��A�Κατ’C�κατά�Μ'��A�Μ'C�με�Μετ'��A�Μετ'C�μετά�Μετ’��A�Μετ’C�μετά�Μ’��A�Μ’C�με�Παρ'��A�Παρ'C�παρά�Παρ’��A�Παρ’C�παρά�Σ'��A�Σ'C�σε�Σ’��A�Σ’C�σε�Τ'��A�Τ'C�τε�Τ’��A�Τ’C�τε�αὑτός��A�αὑC�ὁ�A�τόςC�αὐτός�αὑτὸς��A�αὑC�ὁ�A�τὸςC�αὐτός�δ'��A�δ'C�δέ�δι'��A�δι'C�διά�διὰ��A�διὰC�διά�δι’��A�δι’C�διά�δὲ��A�δὲC�δέ�δ’��A�δ’C�δέ�εφ'��A�εφ'C�επί�εφ’��A�εφ’C�επί�θοἰμάτιον��A�θοC�τό�A�ἰμάτιον�θἡμέρᾳ��A�θC�τῇ�A�ἡμέρᾳ�καθ'��A�καθ'C�κατά�καθ’��A�καθ’C�κατά�κατ'��A�κατ'C�κατά�κατὰ��A�κατὰC�κατά�κατ’��A�κατ’C�κατά�καὐτός��A�κC�καί�A�αὐτός�καὐτὸς��A�κC�καί�A�αὐτὸςC�αὐτός�καὶ��A�καὶC�καί�κεἰ��A�κC�καί�A�εἰ�κεἰς��A�κC�καί�A�εἰς�κοὐ��A�κC�καί�A�οὐ�κἀγώ��A�κἀC�καί�A�γώC�ἐγώ�κἀγὼ��A�κἀC�καί�A�γὼC�ἐγώ�κἀν��A�κC�καί�A�ἀνC�ἐν�κἀς��A�κC�καί�A�ἀςC�ἐς�κᾆτα��A�κC�καί�A�ᾆταC�εἶτα�μ'��A�μ'C�με�μέ��A�μέC�με�μεθ'��A�μεθ'C�μετά�μεθ’��A�μεθ’C�μετά�μετ'��A�μετ'C�μετά�μετὰ��A�μετὰC�μετά�μετ’��A�μετ’C�μετά�μοὔστι��A�μοὔC�μοί�A�στιC�ἐστι�μοὖστι��A�μοὖC�μοί�A�στιC�ἐστι�μὲ��A�μὲC�με�μὲν��A�μὲνC�μέν�μὴν��A�μὴνC�μήν�μ’��A�μ’C�με�οὑμοί��A�οὑC�οἱ�A�μοίC�ἐμoί�οὑμοὶ��A�οὑC�οἱ�A�μοὶC�ἐμoί�οὑμός��A�οὑC�ὁ�A�μόςC�ἐμός�οὑμὸς��A�οὑC�ὁ�A�μὸςC�ἐμός�οὑν��A�οὑC�ὁ�A�νC�ἐν�παρ��A�παρC�παρά�παρ'��A�παρ'C�παρά�παρὰ��A�παρὰC�παρά�παρ’��A�παρ’C�παρά�προὔχοντα��A�προὔC�πρό�A�χονταC�ἔχοντα�προὔχων��A�προὔC�πρό�A�χωνC�ἔχων�σ'��A�σ'C�σε�σέ��A�σέC�σε�σοὐστί��A�σοὐC�σοί�A�στίC�ἐστί�σοὐστὶ��A�σοὐC�σοί�A�στὶC�ἐστί�σοὔστι��A�σοὔC�σοί�A�στιC�ἐστι�σὲ��A�σὲC�σε�σ’��A�σ’C�σε�τ'��A�τ'C�τε�τέ��A�τέC�τε�ταὐτοῦ��A�τC�τοῦ�A�αὐτοῦ�τοὔνομα��A�τοὔC�τό�A�νομαC�ὄνομα�τἀνδρί��A�τC�τῷ�A�ἀνδρί�τἀνδρός��A�τC�τοῦ�A�ἀνδρός�τἀνδρὶ��A�τC�τῷ�A�ἀνδρὶC�ἀνδρί�τἀνδρὸς��A�τC�τοῦ�A�ἀνδρὸςC�ἀνδρός�τἄλλα��A�τC�τὰ�A�ἄλλα�τἆλλα��A�τἆC�τὰ�A�λλαC�ἄλλα�τὠληθές��A�τὠC�τὸ�A�ληθέςC�ἀληθές�τὲ��A�τὲC�τε�τὴν��A�τὴνC�τήν�τὸν��A�τὸνC�τόν�τ’��A�τ’C�τε�χοἱ��A�χC�καί�A�οἱ�χἡ��A�χC�καί�A�ἡ�χἱκετεύετε��A�χC�καί�A�ἱκετεύετε�χὤπως��A�χC�καί�A�ὤπωςC�ὅπως�χὤταν��A�χC�καί�A�ὤτανC�ὅταν�χὤτε��A�χC�καί�A�ὤτεC�ὅτε�χὤτι��A�χC�καί�A�ὤτιC�ὅτι�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�ἀλλ'��A�ἀλλ'C�ἀλλά�ἀλλὰ��A�ἀλλὰC�ἀλλά�ἀλλ’��A�ἀλλ’C�ἀλλά�ἀπὸ��A�ἀπὸC�από�ἀφ'��A�ἀφ'C�από�ἀφ’��A�ἀφ’C�από�ἁγαθαί��A�ἁC�αἱ�A�γαθαίC�ἀγαθαί�ἁγαθαὶ��A�ἁC�αἱ�A�γαθαὶC�ἀγαθαί�ἁγώ��A�ἁC�ἃ�A�γώC�ἐγώ�ἁγὼ��A�ἁC�ἃ�A�γὼC�ἐγώ�ἁλήθεια��A�ἁC�ἡ�A�λήθειαC�ἀλήθεια�ἁνήρ��A�ἁC�ὁ�A�νήρC�ἀνήρ�ἁνὴρ��A�ἁC�ὁ�A�νὴρC�ἀνήρ�ἅνδρες��A�ἅC�οἱ�A�νδρεςC�ἄνδρες�ἅνθρωπος��A�ἅC�ὁ�A�νθρωποςC�ἄνθρωπος�ἐγᾦδα��A�ἐγC�ἐγώ�A�ᾦδαC�οἶδα�ἐγᾦμαι��A�ἐγC�ἐγώ�A�ᾦμαιC�οἶμαι�ἐπ'��A�ἐπ'C�επί�ἐπὶ��A�ἐπὶC�επί�ἐπ’��A�ἐπ’C�επί�Ἐπ'��A�Ἐπ'C�επί�Ἐπ’��A�Ἐπ’C�επί�ὑπ'��A�ὑπ'C�ὑπό�ὑπ’��A�ὑπ’C�ὑπό�ὑφ'��A�ὑφ'C�ὑπό�ὑφ’��A�ὑφ’C�ὑπό�Ὑπ'��A�Ὑπ'C�ὑπό�Ὑπ’��A�Ὑπ’C�ὑπό�ὥνεκα��A�ὥC�οὗ�A�νεκαC�ἕνεκα�ὦνδρες��A�ὦC�ὦ�A�νδρεςC�ἄνδρες�ὦνερ��A�ὦC�ὦ�A�νερC�ἄνερ�᾽ΑΠ'��A�᾽ΑΠ'C�από�᾽ΑΠ’��A�᾽ΑΠ’C�από�᾽Αλλ'��A�᾽Αλλ'C�ἀλλά�᾽Αλλ’��A�᾽Αλλ’C�ἀλλά�᾽Απ'��A�᾽Απ'C�από�᾽Απ’��A�᾽Απ’C�από�᾽Αφ��A�᾽ΑφC�από�—��A�—�’��A�’�’’��A�’’�faster_heuristics�
 
1
+ ��prefix_search� �^†|^⸏|^〈|^…|^……|^,|^:|^;|^\!|^\?|^¿|^؟|^¡|^\(|^\)|^\[|^\]|^\{|^\}|^<|^>|^_|^#|^\*|^&|^。|^?|^!|^,|^、|^;|^:|^~|^·|^।|^،|^۔|^؛|^٪|^\.\.+|^…|^\'|^"|^”|^“|^`|^‘|^´|^’|^‚|^,|^„|^»|^«|^「|^」|^『|^』|^(|^)|^〔|^〕|^【|^】|^《|^》|^〈|^〉|^〈|^〉|^⟦|^⟧|^\$|^£|^€|^¥|^฿|^US\$|^C\$|^A\$|^₽|^﷼|^₴|^₠|^₡|^₢|^₣|^₤|^₥|^₦|^₧|^₨|^₩|^₪|^₫|^€|^₭|^₮|^₯|^₰|^₱|^₲|^₳|^₴|^₵|^₶|^₷|^₸|^₹|^₺|^₻|^₼|^₽|^₾|^₿|^[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]�suffix_search�
2
+ �…$|……$|,$|:$|;$|\!$|\?$|¿$|؟$|¡$|\($|\)$|\[$|\]$|\{$|\}$|<$|>$|_$|#$|\*$|&$|。$|?$|!$|,$|、$|;$|:$|~$|·$|।$|،$|۔$|؛$|٪$|\.\.+$|…$|\'$|"$|”$|“$|`$|‘$|´$|’$|‚$|,$|„$|»$|«$|「$|」$|『$|』$|($|)$|〔$|〕$|【$|】$|《$|》$|〈$|〉$|〈$|〉$|⟦$|⟧$|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]$|†$|⸎$|〉$|(?<=[\u1F00-\u1FFF\u0370-\u03FF])[\-\.⸏]$�infix_finditer�?!\.\.+|…|[\u00A6\u00A9\u00AE\u00B0\u0482\u058D\u058E\u060E\u060F\u06DE\u06E9\u06FD\u06FE\u07F6\u09FA\u0B70\u0BF3-\u0BF8\u0BFA\u0C7F\u0D4F\u0D79\u0F01-\u0F03\u0F13\u0F15-\u0F17\u0F1A-\u0F1F\u0F34\u0F36\u0F38\u0FBE-\u0FC5\u0FC7-\u0FCC\u0FCE\u0FCF\u0FD5-\u0FD8\u109E\u109F\u1390-\u1399\u1940\u19DE-\u19FF\u1B61-\u1B6A\u1B74-\u1B7C\u2100\u2101\u2103-\u2106\u2108\u2109\u2114\u2116\u2117\u211E-\u2123\u2125\u2127\u2129\u212E\u213A\u213B\u214A\u214C\u214D\u214F\u218A\u218B\u2195-\u2199\u219C-\u219F\u21A1\u21A2\u21A4\u21A5\u21A7-\u21AD\u21AF-\u21CD\u21D0\u21D1\u21D3\u21D5-\u21F3\u2300-\u2307\u230C-\u231F\u2322-\u2328\u232B-\u237B\u237D-\u239A\u23B4-\u23DB\u23E2-\u2426\u2440-\u244A\u249C-\u24E9\u2500-\u25B6\u25B8-\u25C0\u25C2-\u25F7\u2600-\u266E\u2670-\u2767\u2794-\u27BF\u2800-\u28FF\u2B00-\u2B2F\u2B45\u2B46\u2B4D-\u2B73\u2B76-\u2B95\u2B98-\u2BC8\u2BCA-\u2BFE\u2CE5-\u2CEA\u2E80-\u2E99\u2E9B-\u2EF3\u2F00-\u2FD5\u2FF0-\u2FFB\u3004\u3012\u3013\u3020\u3036\u3037\u303E\u303F\u3190\u3191\u3196-\u319F\u31C0-\u31E3\u3200-\u321E\u322A-\u3247\u3250\u3260-\u327F\u328A-\u32B0\u32C0-\u32FE\u3300-\u33FF\u4DC0-\u4DFF\uA490-\uA4C6\uA828-\uA82B\uA836\uA837\uA839\uAA77-\uAA79\uFDFD\uFFE4\uFFE8\uFFED\uFFEE\uFFFC\uFFFD\U00010137-\U0001013F\U00010179-\U00010189\U0001018C-\U0001018E\U00010190-\U0001019B\U000101A0\U000101D0-\U000101FC\U00010877\U00010878\U00010AC8\U0001173F\U00016B3C-\U00016B3F\U00016B45\U0001BC9C\U0001D000-\U0001D0F5\U0001D100-\U0001D126\U0001D129-\U0001D164\U0001D16A-\U0001D16C\U0001D183\U0001D184\U0001D18C-\U0001D1A9\U0001D1AE-\U0001D1E8\U0001D200-\U0001D241\U0001D245\U0001D300-\U0001D356\U0001D800-\U0001D9FF\U0001DA37-\U0001DA3A\U0001DA6D-\U0001DA74\U0001DA76-\U0001DA83\U0001DA85\U0001DA86\U0001ECAC\U0001F000-\U0001F02B\U0001F030-\U0001F093\U0001F0A0-\U0001F0AE\U0001F0B1-\U0001F0BF\U0001F0C1-\U0001F0CF\U0001F0D1-\U0001F0F5\U0001F110-\U0001F16B\U0001F170-\U0001F1AC\U0001F1E6-\U0001F202\U0001F210-\U0001F23B\U0001F240-\U0001F248\U0001F250\U0001F251\U0001F260-\U0001F265\U0001F300-\U0001F3FA\U0001F400-\U0001F6D4\U0001F6E0-\U0001F6EC\U0001F6F0-\U0001F6F9\U0001F700-\U0001F773\U0001F780-\U0001F7D8\U0001F800-\U0001F80B\U0001F810-\U0001F847\U0001F850-\U0001F859\U0001F860-\U0001F887\U0001F890-\U0001F8AD\U0001F900-\U0001F90B\U0001F910-\U0001F93E\U0001F940-\U0001F970\U0001F973-\U0001F976\U0001F97A\U0001F97C-\U0001F9A2\U0001F9B0-\U0001F9B9\U0001F9C0-\U0001F9C2\U0001F9D0-\U0001F9FF\U0001FA60-\U0001FA6D]|(?<=[0-9])[+\-\*^](?=[0-9-])|(?<=[a-z\uFF41-\uFF5A\u00DF-\u00F6\u00F8-\u00FF\u0101\u0103\u0105\u0107\u0109\u010B\u010D\u010F\u0111\u0113\u0115\u0117\u0119\u011B\u011D\u011F\u0121\u0123\u0125\u0127\u0129\u012B\u012D\u012F\u0131\u0133\u0135\u0137\u0138\u013A\u013C\u013E\u0140\u0142\u0144\u0146\u0148\u0149\u014B\u014D\u014F\u0151\u0153\u0155\u0157\u0159\u015B\u015D\u015F\u0161\u0163\u0165\u0167\u0169\u016B\u016D\u016F\u0171\u0173\u0175\u0177\u017A\u017C\u017E\u017F\u0180\u0183\u0185\u0188\u018C\u018D\u0192\u0195\u0199-\u019B\u019E\u01A1\u01A3\u01A5\u01A8\u01AA\u01AB\u01AD\u01B0\u01B4\u01B6\u01B9\u01BA\u01BD-\u01BF\u01C6\u01C9\u01CC\u01CE\u01D0\u01D2\u01D4\u01D6\u01D8\u01DA\u01DC\u01DD\u01DF\u01E1\u01E3\u01E5\u01E7\u01E9\u01EB\u01ED\u01EF\u01F0\u01F3\u01F5\u01F9\u01FB\u01FD\u01FF\u0201\u0203\u0205\u0207\u0209\u020B\u020D\u020F\u0211\u0213\u0215\u0217\u0219\u021B\u021D\u021F\u0221\u0223\u0225\u0227\u0229\u022B\u022D\u022F\u0231\u0233-\u0239\u023C\u023F\u0240\u0242\u0247\u0249\u024B\u024D\u024F\u2C61\u2C65\u2C66\u2C68\u2C6A\u2C6C\u2C71\u2C73\u2C74\u2C76-\u2C7B\uA723\uA725\uA727\uA729\uA72B\uA72D\uA72F-\uA731\uA733\uA735\uA737\uA739\uA73B\uA73D\uA73F\uA741\uA743\uA745\uA747\uA749\uA74B\uA74D\uA74F\uA751\uA753\uA755\uA757\uA759\uA75B\uA75D\uA75F\uA761\uA763\uA765\uA767\uA769\uA76B\uA76D\uA76F\uA771-\uA778\uA77A\uA77C\uA77F\uA781\uA783\uA785\uA787\uA78C\uA78E\uA791\uA793-\uA795\uA797\uA799\uA79B\uA79D\uA79F\uA7A1\uA7A3\uA7A5\uA7A7\uA7A9\uA7AF\uA7B5\uA7B7\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E01\u1E03\u1E05\u1E07\u1E09\u1E0B\u1E0D\u1E0F\u1E11\u1E13\u1E15\u1E17\u1E19\u1E1B\u1E1D\u1E1F\u1E21\u1E23\u1E25\u1E27\u1E29\u1E2B\u1E2D\u1E2F\u1E31\u1E33\u1E35\u1E37\u1E39\u1E3B\u1E3D\u1E3F\u1E41\u1E43\u1E45\u1E47\u1E49\u1E4B\u1E4D\u1E4F\u1E51\u1E53\u1E55\u1E57\u1E59\u1E5B\u1E5D\u1E5F\u1E61\u1E63\u1E65\u1E67\u1E69\u1E6B\u1E6D\u1E6F\u1E71\u1E73\u1E75\u1E77\u1E79\u1E7B\u1E7D\u1E7F\u1E81\u1E83\u1E85\u1E87\u1E89\u1E8B\u1E8D\u1E8F\u1E91\u1E93\u1E95-\u1E9D\u1E9F\u1EA1\u1EA3\u1EA5\u1EA7\u1EA9\u1EAB\u1EAD\u1EAF\u1EB1\u1EB3\u1EB5\u1EB7\u1EB9\u1EBB\u1EBD\u1EBF\u1EC1\u1EC3\u1EC5\u1EC7\u1EC9\u1ECB\u1ECD\u1ECF\u1ED1\u1ED3\u1ED5\u1ED7\u1ED9\u1EDB\u1EDD\u1EDF\u1EE1\u1EE3\u1EE5\u1EE7\u1EE9\u1EEB\u1EED\u1EEF\u1EF1\u1EF3\u1EF5\u1EF7\u1EF9\u1EFB\u1EFD\u1EFFёа-яәөүҗңһα-ωάέίόώήύа-щюяіїєґѓѕјљњќѐѝ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉〈〉⟦⟧])\.(?=[A-Z\uFF21-\uFF3A\u00C0-\u00D6\u00D8-\u00DE\u0100\u0102\u0104\u0106\u0108\u010A\u010C\u010E\u0110\u0112\u0114\u0116\u0118\u011A\u011C\u011E\u0120\u0122\u0124\u0126\u0128\u012A\u012C\u012E\u0130\u0132\u0134\u0136\u0139\u013B\u013D\u013F\u0141\u0143\u0145\u0147\u014A\u014C\u014E\u0150\u0152\u0154\u0156\u0158\u015A\u015C\u015E\u0160\u0162\u0164\u0166\u0168\u016A\u016C\u016E\u0170\u0172\u0174\u0176\u0178\u0179\u017B\u017D\u0181\u0182\u0184\u0186\u0187\u0189-\u018B\u018E-\u0191\u0193\u0194\u0196-\u0198\u019C\u019D\u019F\u01A0\u01A2\u01A4\u01A6\u01A7\u01A9\u01AC\u01AE\u01AF\u01B1-\u01B3\u01B5\u01B7\u01B8\u01BC\u01C4\u01C7\u01CA\u01CD\u01CF\u01D1\u01D3\u01D5\u01D7\u01D9\u01DB\u01DE\u01E0\u01E2\u01E4\u01E6\u01E8\u01EA\u01EC\u01EE\u01F1\u01F4\u01F6-\u01F8\u01FA\u01FC\u01FE\u0200\u0202\u0204\u0206\u0208\u020A\u020C\u020E\u0210\u0212\u0214\u0216\u0218\u021A\u021C\u021E\u0220\u0222\u0224\u0226\u0228\u022A\u022C\u022E\u0230\u0232\u023A\u023B\u023D\u023E\u0241\u0243-\u0246\u0248\u024A\u024C\u024E\u2C60\u2C62-\u2C64\u2C67\u2C69\u2C6B\u2C6D-\u2C70\u2C72\u2C75\u2C7E\u2C7F\uA722\uA724\uA726\uA728\uA72A\uA72C\uA72E\uA732\uA734\uA736\uA738\uA73A\uA73C\uA73E\uA740\uA742\uA744\uA746\uA748\uA74A\uA74C\uA74E\uA750\uA752\uA754\uA756\uA758\uA75A\uA75C\uA75E\uA760\uA762\uA764\uA766\uA768\uA76A\uA76C\uA76E\uA779\uA77B\uA77D\uA77E\uA780\uA782\uA784\uA786\uA78B\uA78D\uA790\uA792\uA796\uA798\uA79A\uA79C\uA79E\uA7A0\uA7A2\uA7A4\uA7A6\uA7A8\uA7AA-\uA7AE\uA7B0-\uA7B4\uA7B6\uA7B8\u1E00\u1E02\u1E04\u1E06\u1E08\u1E0A\u1E0C\u1E0E\u1E10\u1E12\u1E14\u1E16\u1E18\u1E1A\u1E1C\u1E1E\u1E20\u1E22\u1E24\u1E26\u1E28\u1E2A\u1E2C\u1E2E\u1E30\u1E32\u1E34\u1E36\u1E38\u1E3A\u1E3C\u1E3E\u1E40\u1E42\u1E44\u1E46\u1E48\u1E4A\u1E4C\u1E4E\u1E50\u1E52\u1E54\u1E56\u1E58\u1E5A\u1E5C\u1E5E\u1E60\u1E62\u1E64\u1E66\u1E68\u1E6A\u1E6C\u1E6E\u1E70\u1E72\u1E74\u1E76\u1E78\u1E7A\u1E7C\u1E7E\u1E80\u1E82\u1E84\u1E86\u1E88\u1E8A\u1E8C\u1E8E\u1E90\u1E92\u1E94\u1E9E\u1EA0\u1EA2\u1EA4\u1EA6\u1EA8\u1EAA\u1EAC\u1EAE\u1EB0\u1EB2\u1EB4\u1EB6\u1EB8\u1EBA\u1EBC\u1EBE\u1EC0\u1EC2\u1EC4\u1EC6\u1EC8\u1ECA\u1ECC\u1ECE\u1ED0\u1ED2\u1ED4\u1ED6\u1ED8\u1EDA\u1EDC\u1EDE\u1EE0\u1EE2\u1EE4\u1EE6\u1EE8\u1EEA\u1EEC\u1EEE\u1EF0\u1EF2\u1EF4\u1EF6\u1EF8\u1EFA\u1EFC\u1EFEЁА-ЯӘӨҮҖҢҺΑ-ΩΆΈΊΌΏΉΎА-ЩЮЯІЇЄҐЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F\'"”“`‘´’‚,„»«「」『』()〔〕【】《》〈〉〈〉⟦⟧])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F]),(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])(?:-|–|—|--|---|——|~)(?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F0-9])[:<>=/](?=[A-Za-z\uFF21-\uFF3A\uFF41-\uFF5A\u00C0-\u00D6\u00D8-\u00F6\u00F8-\u00FF\u0100-\u017F\u0180-\u01BF\u01C4-\u024F\u2C60-\u2C7B\u2C7E\u2C7F\uA722-\uA76F\uA771-\uA787\uA78B-\uA78E\uA790-\uA7B9\uA7FA\uAB30-\uAB5A\uAB60-\uAB64\u0250-\u02AF\u1D00-\u1D25\u1D6B-\u1D77\u1D79-\u1D9A\u1E00-\u1EFFёа-яЁА-ЯәөүҗңһӘӨҮҖҢҺα-ωάέίόώήύΑ-ΩΆΈΊΌΏΉΎа-щюяіїєґА-ЩЮЯІЇЄҐѓѕјљњќѐѝЃЅЈЉЊЌЀЍ\u1200-\u137F\u0980-\u09FF\u0591-\u05F4\uFB1D-\uFB4F\u0620-\u064A\u066E-\u06D5\u06E5-\u06FF\u0750-\u077F\u08A0-\u08BD\uFB50-\uFBB1\uFBD3-\uFD3D\uFD50-\uFDC7\uFDF0-\uFDFB\uFE70-\uFEFC\U0001EE00-\U0001EEBB\u0D80-\u0DFF\u0900-\u097F\u0C80-\u0CFF\u0B80-\u0BFF\u0C00-\u0C7F\uAC00-\uD7AF\u1100-\u11FF\u3040-\u309F\u30A0-\u30FFー\u4E00-\u62FF\u6300-\u77FF\u7800-\u8CFF\u8D00-\u9FFF\u3400-\u4DBF\U00020000-\U000215FF\U00021600-\U000230FF\U00023100-\U000245FF\U00024600-\U000260FF\U00026100-\U000275FF\U00027600-\U000290FF\U00029100-\U0002A6DF\U0002A700-\U0002B73F\U0002B740-\U0002B81F\U0002B820-\U0002CEAF\U0002CEB0-\U0002EBEF\u2E80-\u2EFF\u2F00-\u2FDF\u2FF0-\u2FFF\u3000-\u303F\u31C0-\u31EF\u3200-\u32FF\u3300-\u33FF\uF900-\uFAFF\uFE30-\uFE4F\U0001F200-\U0001F2FF\U0002F800-\U0002FA1F])|(?<=[\u1F00-\u1FFF\u0370-\u03FF])—�token_match��url_match�
3
  ��A�
4
  � ��A� �'��A�'�''��A�''�(*_*)��A�(*_*)�(-8��A�(-8�(-:��A�(-:�(-;��A�(-;�(-_-)��A�(-_-)�(._.)��A�(._.)�(:��A�(:�(;��A�(;�(=��A�(=�(>_<)��A�(>_<)�(^_^)��A�(^_^)�(o:��A�(o:�(¬_¬)��A�(¬_¬)�(ಠ_ಠ)��A�(ಠ_ಠ)�(╯°□°)╯︵┻━┻��A�(╯°□°)╯︵┻━┻�)-:��A�)-:�):��A�):�-_-��A�-_-�-__-��A�-__-�._.��A�._.�0.0��A�0.0�0.o��A�0.o�0_0��A�0_0�0_o��A�0_o�8)��A�8)�8-)��A�8-)�8-D��A�8-D�8D��A�8D�:'(��A�:'(�:')��A�:')�:'-(��A�:'-(�:'-)��A�:'-)�:(��A�:(�:((��A�:((�:(((��A�:(((�:()��A�:()�:)��A�:)�:))��A�:))�:)))��A�:)))�:*��A�:*�:-(��A�:-(�:-((��A�:-((�:-(((��A�:-(((�:-)��A�:-)�:-))��A�:-))�:-)))��A�:-)))�:-*��A�:-*�:-/��A�:-/�:-0��A�:-0�:-3��A�:-3�:->��A�:->�:-D��A�:-D�:-O��A�:-O�:-P��A�:-P�:-X��A�:-X�:-]��A�:-]�:-o��A�:-o�:-p��A�:-p�:-x��A�:-x�:-|��A�:-|�:-}��A�:-}�:/��A�:/�:0��A�:0�:1��A�:1�:3��A�:3�:>��A�:>�:D��A�:D�:O��A�:O�:P��A�:P�:X��A�:X�:]��A�:]�:o��A�:o�:o)��A�:o)�:p��A�:p�:x��A�:x�:|��A�:|�:}��A�:}�:’(��A�:’(�:’)��A�:’)�:’-(��A�:’-(�:’-)��A�:’-)�;)��A�;)�;-)��A�;-)�;-D��A�;-D�;D��A�;D�;_;��A�;_;�<.<��A�<.<�</3��A�</3�<3��A�<3�<33��A�<33�<333��A�<333�<space>��A�<space>�=(��A�=(�=)��A�=)�=/��A�=/�=3��A�=3�=D��A�=D�=[��A�=[�=]��A�=]�=|��A�=|�>.<��A�>.<�>.>��A�>.>�>:(��A�>:(�>:o��A�>:o�><(((*>��A�><(((*>�@_@��A�@_@�C++��A�C++�O.O��A�O.O�O.o��A�O.o�O_O��A�O_O�O_o��A�O_o�V.V��A�V.V�V_V��A�V_V�XD��A�XD�XDD��A�XDD�[-:��A�[-:�[:��A�[:�[=��A�[=�\")��A�\")�\n��A�\n�\t��A�\t�]=��A�]=�^_^��A�^_^�^__^��A�^__^�^___^��A�^___^�a.��A�a.�b.��A�b.�c.��A�c.�d.��A�d.�e.��A�e.�f.��A�f.�g.��A�g.�h.��A�h.�i.��A�i.�j.��A�j.�k.��A�k.�l.��A�l.�m.��A�m.�n.��A�n.�o.��A�o.�o.0��A�o.0�o.O��A�o.O�o.o��A�o.o�o_0��A�o_0�o_O��A�o_O�o_o��A�o_o�p.��A�p.�q.��A�q.�r.��A�r.�s.��A�s.�t.��A�t.�u.��A�u.�v.��A�v.�v.v��A�v.v�v_v��A�v_v�w.��A�w.�x.��A�x.�xD��A�xD�xDD��A�xDD�y.��A�y.�z.��A�z.� ��A� C� �¯\(ツ)/¯��A�¯\(ツ)/¯�°C.��A�°�A�C�A�.�°F.��A�°�A�F�A�.�°K.��A�°�A�K�A�.�°c.��A�°�A�c�A�.�°f.��A�°�A�f�A�.�°k.��A�°�A�k�A�.�ä.��A�ä.�ö.��A�ö.�ü.��A�ü.�Δ'��A�Δ'C�δέ�Δι'��A�Δι'C�διά�Δι’��A�Δι’C�διά�Δ’��A�Δ’C�δέ�Εφ'��A�Εφ'C�επί�Εφ’��A�Εφ’C�επί�Καθ'��A�Καθ'C�κατά�Καθ’��A�Καθ’C�κατά�Κατ'��A�Κατ'C�κατά�Κατ’��A�Κατ’C�κατά�Μ'��A�Μ'C�με�Μετ'��A�Μετ'C�μετά�Μετ’��A�Μετ’C�μετά�Μ’��A�Μ’C�με�Παρ'��A�Παρ'C�παρά�Παρ’��A�Παρ’C�παρά�Σ'��A�Σ'C�σε�Σ’��A�Σ’C�σε�Τ'��A�Τ'C�τε�Τ’��A�Τ’C�τε�αὑτός��A�αὑC�ὁ�A�τόςC�αὐτός�αὑτὸς��A�αὑC�ὁ�A�τὸςC�αὐτός�δ'��A�δ'C�δέ�δι'��A�δι'C�διά�διὰ��A�διὰC�διά�δι’��A�δι’C�διά�δὲ��A�δὲC�δέ�δ’��A�δ’C�δέ�εφ'��A�εφ'C�επί�εφ’��A�εφ’C�επί�θοἰμάτιον��A�θοC�τό�A�ἰμάτιον�θἡμέρᾳ��A�θC�τῇ�A�ἡμέρᾳ�καθ'��A�καθ'C�κατά�καθ’��A�καθ’C�κατά�κατ'��A�κατ'C�κατά�κατὰ��A�κατὰC�κατά�κατ’��A�κατ’C�κατά�καὐτός��A�κC�καί�A�αὐτός�καὐτὸς��A�κC�καί�A�αὐτὸςC�αὐτός�καὶ��A�καὶC�καί�κεἰ��A�κC�καί�A�εἰ�κεἰς��A�κC�καί�A�εἰς�κοὐ��A�κC�καί�A�οὐ�κἀγώ��A�κἀC�καί�A�γώC�ἐγώ�κἀγὼ��A�κἀC�καί�A�γὼC�ἐγώ�κἀν��A�κC�καί�A�ἀνC�ἐν�κἀς��A�κC�καί�A�ἀςC�ἐς�κᾆτα��A�κC�καί�A�ᾆταC�εἶτα�μ'��A�μ'C�με�μέ��A�μέC�με�μεθ'��A�μεθ'C�μετά�μεθ’��A�μεθ’C�μετά�μετ'��A�μετ'C�μετά�μετὰ��A�μετὰC�μετά�μετ’��A�μετ’C�μετά�μοὔστι��A�μοὔC�μοί�A�στιC�ἐστι�μοὖστι��A�μοὖC�μοί�A�στιC�ἐστι�μὲ��A�μὲC�με�μὲν��A�μὲνC�μέν�μὴν��A�μὴνC�μήν�μ’��A�μ’C�με�οὑμοί��A�οὑC�οἱ�A�μοίC�ἐμoί�οὑμοὶ��A�οὑC�οἱ�A�μοὶC�ἐμoί�οὑμός��A�οὑC�ὁ�A�μόςC�ἐμός�οὑμὸς��A�οὑC�ὁ�A�μὸςC�ἐμός�οὑν��A�οὑC�ὁ�A�νC�ἐν�παρ��A�παρC�παρά�παρ'��A�παρ'C�παρά�παρὰ��A�παρὰC�παρά�παρ’��A�παρ’C�παρά�προὔχοντα��A�προὔC�πρό�A�χονταC�ἔχοντα�προὔχων��A�προὔC�πρό�A�χωνC�ἔχων�σ'��A�σ'C�σε�σέ��A�σέC�σε�σοὐστί��A�σοὐC�σοί�A�στίC�ἐστί�σοὐστὶ��A�σοὐC�σοί�A�στὶC�ἐστί�σοὔστι��A�σοὔC�σοί�A�στιC�ἐστι�σὲ��A�σὲC�σε�σ’��A�σ’C�σε�τ'��A�τ'C�τε�τέ��A�τέC�τε�ταὐτοῦ��A�τC�τοῦ�A�αὐτοῦ�τοὔνομα��A�τοὔC�τό�A�νομαC�ὄνομα�τἀνδρί��A�τC�τῷ�A�ἀνδρί�τἀνδρός��A�τC�τοῦ�A�ἀνδρός�τἀνδρὶ��A�τC�τῷ�A�ἀνδρὶC�ἀνδρί�τἀνδρὸς��A�τC�τοῦ�A�ἀνδρὸςC�ἀνδρός�τἄλλα��A�τC�τὰ�A�ἄλλα�τἆλλα��A�τἆC�τὰ�A�λλαC�ἄλλα�τὠληθές��A�τὠC�τὸ�A�ληθέςC�ἀληθές�τὲ��A�τὲC�τε�τὴν��A�τὴνC�τήν�τὸν��A�τὸνC�τόν�τ’��A�τ’C�τε�χοἱ��A�χC�καί�A�οἱ�χἡ��A�χC�καί�A�ἡ�χἱκετεύετε��A�χC�καί�A�ἱκετεύετε�χὤπως��A�χC�καί�A�ὤπωςC�ὅπως�χὤταν��A�χC�καί�A�ὤτανC�ὅταν�χὤτε��A�χC�καί�A�ὤτεC�ὅτε�χὤτι��A�χC�καί�A�ὤτιC�ὅτι�ಠ_ಠ��A�ಠ_ಠ�ಠ︵ಠ��A�ಠ︵ಠ�ἀλλ'��A�ἀλλ'C�ἀλλά�ἀλλὰ��A�ἀλλὰC�ἀλλά�ἀλλ’��A�ἀλλ’C�ἀλλά�ἀπὸ��A�ἀπὸC�από�ἀφ'��A�ἀφ'C�από�ἀφ’��A�ἀφ’C�από�ἁγαθαί��A�ἁC�αἱ�A�γαθαίC�ἀγαθαί�ἁγαθαὶ��A�ἁC�αἱ�A�γαθαὶC�ἀγαθαί�ἁγώ��A�ἁC�ἃ�A�γώC�ἐγώ�ἁγὼ��A�ἁC�ἃ�A�γὼC�ἐγώ�ἁλήθεια��A�ἁC�ἡ�A�λήθειαC�ἀλήθεια�ἁνήρ��A�ἁC�ὁ�A�νήρC�ἀνήρ�ἁνὴρ��A�ἁC�ὁ�A�νὴρC�ἀνήρ�ἅνδρες��A�ἅC�οἱ�A�νδρεςC�ἄνδρες�ἅνθρωπος��A�ἅC�ὁ�A�νθρωποςC�ἄνθρωπος�ἐγᾦδα��A�ἐγC�ἐγώ�A�ᾦδαC�οἶδα�ἐγᾦμαι��A�ἐγC�ἐγώ�A�ᾦμαιC�οἶμαι�ἐπ'��A�ἐπ'C�επί�ἐπὶ��A�ἐπὶC�επί�ἐπ’��A�ἐπ’C�επί�Ἐπ'��A�Ἐπ'C�επί�Ἐπ’��A�Ἐπ’C�επί�ὑπ'��A�ὑπ'C�ὑπό�ὑπ’��A�ὑπ’C�ὑπό�ὑφ'��A�ὑφ'C�ὑπό�ὑφ’��A�ὑφ’C�ὑπό�Ὑπ'��A�Ὑπ'C�ὑπό�Ὑπ’��A�Ὑπ’C�ὑπό�ὥνεκα��A�ὥC�οὗ�A�νεκαC�ἕνεκα�ὦνδρες��A�ὦC�ὦ�A�νδρεςC�ἄνδρες�ὦνερ��A�ὦC�ὦ�A�νερC�ἄνερ�᾽ΑΠ'��A�᾽ΑΠ'C�από�᾽ΑΠ’��A�᾽ΑΠ’C�από�᾽Αλλ'��A�᾽Αλλ'C�ἀλλά�᾽Αλλ’��A�᾽Αλλ’C�ἀλλά�᾽Απ'��A�᾽Απ'C�από�᾽Απ’��A�᾽Απ’C�από�᾽Αφ��A�᾽ΑφC�από�—��A�—�’��A�’�’’��A�’’�faster_heuristics�
trainable_lemmatizer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:daa940742b9dc6d624ec6f0be5c6b4dd75aaea5ad4704727d568b3408cd543ef
3
  size 71957523
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7f4453cd08855bb9e778c616b004de2d0d62633e3426b9d3c2fff7310d169e22
3
  size 71957523
transformer/model CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:93121f678bdeaf358a029839af586fdcc837c37bde94b38acd80c7dff23022a6
3
- size 453378980
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:94efe0e07cb1f29afb0fe4645b0bb4eee21815a1f773c638b1ff80cb5df65516
3
+ size 453378924
vocab/strings.json CHANGED
The diff for this file is too large to render. See raw diff