data example
Browse files
README.md
CHANGED
@@ -19,6 +19,10 @@ the data can be prepared like this:
|
|
19 |
the broken_text is used as input, while the text is the output
|
20 |
```python
|
21 |
|
|
|
|
|
|
|
|
|
22 |
chars_to_ignore_regex = "[^A-Za-z0-9\ö\ä\ü\Ö\Ä\Ü\ß\-,;.:?! ]+"
|
23 |
broken_chars_to_ignore_regex = "[^A-Za-z0-9\ö\ä\ü\Ö\Ä\Ü\ß\- ]+"
|
24 |
|
|
|
19 |
the broken_text is used as input, while the text is the output
|
20 |
```python
|
21 |
|
22 |
+
import re
|
23 |
+
import phonetics
|
24 |
+
import random
|
25 |
+
|
26 |
chars_to_ignore_regex = "[^A-Za-z0-9\ö\ä\ü\Ö\Ä\Ü\ß\-,;.:?! ]+"
|
27 |
broken_chars_to_ignore_regex = "[^A-Za-z0-9\ö\ä\ü\Ö\Ä\Ü\ß\- ]+"
|
28 |
|