Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Spaces:
huggingface
/
text-data-filtering
like
34
Running
App
Files
Files
Community
4
8f0da78
text-data-filtering
6 contributors
History:
43 commits
HugoLaurencon
better visualization
8f0da78
almost 3 years ago
.gitattributes
Safe
1.39 kB
chinese visu
almost 3 years ago
.gitignore
Safe
25 Bytes
new tool to analyse our own doc
almost 3 years ago
LICENSE
Safe
11.4 kB
Create LICENSE
almost 3 years ago
README.md
Safe
909 Bytes
initial commit
almost 3 years ago
app.py
Safe
23.1 kB
better visualization
almost 3 years ago
badwords.py
Safe
57.7 kB
test
almost 3 years ago
en.arpa.bin
Safe
4.4 GB
LFS
test
almost 3 years ago
en.sp.model
Safe
1.39 MB
LFS
test
almost 3 years ago
en_examples_with_stats.json
Safe
237 MB
LFS
filter on repetition removal
almost 3 years ago
explanation_filtering_pipeline.pdf
Safe
216 kB
filter on repetition removal
almost 3 years ago
filtering.py
Safe
30.5 kB
fix division by 0 in compute_special_characters_ratio
almost 3 years ago
languages_id.py
Safe
5.48 kB
test
almost 3 years ago
lid.176.bin
131 MB
LFS
test
almost 3 years ago
normalization.py
Safe
941 Bytes
test
almost 3 years ago
parameters_filtering.py
Safe
31.4 kB
new tool to analyse our own doc
almost 3 years ago
requirements.txt
Safe
76 Bytes
fix requirements
almost 3 years ago
stopwords.py
Safe
99.2 kB
test
almost 3 years ago