--- tags: - image-to-image license: apache-2.0 --- # About `sbb_binarization` This is a CNN model for document image binarization. It can be used to convert all pixels in a color or grayscale document image to only black or white pixels. The main aim is to improve the contrast between foreground (text) and background (paper) for purposes of OCR. The model is based on a `ResNet50-Unet` model. # Results In the *DocEng’2021 Time-Quality Binarization Competition* ([paper](https://dib.cin.ufpe.br/docs/DocEng21_bin_competition_report.pdf)), the model ranked 12 times under the top 8 of 63 methods, winning 2 tasks. In the *ICDAR 2021 Competition on Time-Quality Document Image Binarization* ([paper](https://dib.cin.ufpe.br/docs/papers/ICDAR2021-TQDIB_final_published.pdf)), the model ranked 2 times under the top 20 of 61 methods, winning 1 task. For details, see [sbb_binarization](https://github.com/qurator-spk/sbb_binarization) on GitHub. # Weights We provide a `saved model` for Tensorflow2. | Model | Downloads | -------------| ------------------------ | `2021_03_09` | [`saved_model`](https://huggingface.co/SBB/sbb_binarization/tree/main/saved_model)