flozi00 commited on
Commit
eca2b01
1 Parent(s): 5278173

data example

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -19,6 +19,10 @@ the data can be prepared like this:
19
  the broken_text is used as input, while the text is the output
20
  ```python
21
 
 
 
 
 
22
  chars_to_ignore_regex = "[^A-Za-z0-9\ö\ä\ü\Ö\Ä\Ü\ß\-,;.:?! ]+"
23
  broken_chars_to_ignore_regex = "[^A-Za-z0-9\ö\ä\ü\Ö\Ä\Ü\ß\- ]+"
24
 
 
19
  the broken_text is used as input, while the text is the output
20
  ```python
21
 
22
+ import re
23
+ import phonetics
24
+ import random
25
+
26
  chars_to_ignore_regex = "[^A-Za-z0-9\ö\ä\ü\Ö\Ä\Ü\ß\-,;.:?! ]+"
27
  broken_chars_to_ignore_regex = "[^A-Za-z0-9\ö\ä\ü\Ö\Ä\Ü\ß\- ]+"
28