Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
document_redaction
/
tools
Ctrl+K
Ctrl+K
3 contributors
History:
156 commits
seanpedrickcase
Updated review functions to update with manual reviews. Minor package update
80268bb
9 days ago
__init__.py
Safe
0 Bytes
Initial commit
over 1 year ago
auth.py
Safe
2.46 kB
Added compatibility with gradio_image_annotation for passing through id and text properties to annotator. Corrected csv location for Textract api calls. Other minor changes
5 months ago
aws_functions.py
Safe
9.51 kB
Updated duplicate pages functionality. Improve redaction efficiency a little with concat method. Minor modification to documentation and interface
3 months ago
aws_textract.py
Safe
18.3 kB
Added support for other languages. Improved DynamoDB download
15 days ago
cli_redact.py
Safe
9.08 kB
Updated command line redaction script with more options
15 days ago
config.py
Safe
22.5 kB
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
14 days ago
custom_csvlogger.py
Safe
12.9 kB
Updated packages. Corrected CSV logger headings, can now submit custom log csv names to S3. Started work on identifying and deduplicating at the line level
2 months ago
custom_image_analyser_engine.py
Safe
68.6 kB
Corrected some multiple xlsx/docx file redaction issues. package updates.
14 days ago
data_anonymise.py
Safe
43.1 kB
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
14 days ago
example_cli_calls.txt
Safe
769 Bytes
Repaired Dockerfile hopefully finally
15 days ago
file_conversion.py
Safe
109 kB
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
14 days ago
file_redaction.py
Safe
139 kB
Fixed minor issue of empty ocr_results_with_words object when doing Textract bulk calls
14 days ago
find_duplicate_pages.py
Safe
58.5 kB
Updated review functions to update with manual reviews. Minor package update
9 days ago
helper_functions.py
Safe
30.2 kB
Added support for other languages. Improved DynamoDB download
15 days ago
load_spacy_model_custom_recognisers.py
Safe
24.4 kB
Updated documentation. Fix on ocr_output upload before pdf. Duplicate page fix
14 days ago
presidio_analyzer_custom.py
Safe
4.92 kB
More config options. Fixed some bugs with removing elements from review page and Adobe export. Some UI rearrangements
6 months ago
redaction_review.py
Safe
96.1 kB
Updated review functions to update with manual reviews. Minor package update
9 days ago
textract_batch_call.py
Safe
28 kB
Expanded checks for out of range page cropboxes
4 months ago