File size: 1,176 Bytes

2b5dc45
1d7a226
 
2b5dc45
 
a51920c
fb51426
5376dbb
 
a563609
a51920c
 
 
 
42b8e8b
a51920c
a563609
a51920c
 
 
42b8e8b
a51920c

---
tags:
- image-to-image
license: apache-2.0
---
# About `sbb_binarization`

This is a CNN model for document image binarization. It can be 
used to convert all pixels in a color or grayscale document image 
to only black or white pixels. The main aim is to improve the 
contrast between foreground (text) and background (paper) for 
purposes of OCR. The model is based on a `ResNet50-Unet` model.

# Results
In the *DocEng’2021 Time-Quality Binarization Competition* 
([paper](https://dib.cin.ufpe.br/docs/DocEng21_bin_competition_report.pdf)), 
the model ranked 12 times under the top 8 of 63 methods, winning 2 tasks.
 
In the *ICDAR 2021 Competition on Time-Quality Document Image 
Binarization* ([paper](https://dib.cin.ufpe.br/docs/papers/ICDAR2021-TQDIB_final_published.pdf)), 
the model ranked 2 times under the top 20 of 61 methods, winning 1 task.

For details, see [sbb_binarization](https://github.com/qurator-spk/sbb_binarization) on GitHub.

# Weights
We provide a `saved model` for Tensorflow2.

| Model | Downloads
| -------------| ------------------------
| `2021_03_09` | [`saved_model`](https://huggingface.co/SBB/sbb_binarization/tree/main/saved_model)