File size: 705 Bytes
76ff029
 
 
 
 
 
 
 
 
 
 
 
 
aa6573b
76ff029
 
 
 
 
 
 
 
 
1d0bb74
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
language: ja
tags:
- image-to-text
license: apache-2.0
datasets:
- manga109s
---

# Manga OCR

Optical character recognition for Japanese text, with the main focus being Japanese manga.

It uses [Vision Encoder Decoder](https://huggingface.co/docs/transformers/model_doc/vision-encoder-decoder) framework.

Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality
text recognition, robust against various scenarios specific to manga:
- both vertical and horizontal text
- text with furigana
- text overlaid on images
- wide variety of fonts and font styles
- low quality images

Code is available [here](https://github.com/kha-white/manga_ocr).