bropines
/

ballon-translator-models

Inference Endpoints

Model card Files Files and versions Community

ballon-translator-models / models /manga-ocr-base /README.md

bropines's picture

Upload 18 files

caea6ad over 1 year ago

|

history blame contribute delete

703 Bytes

	---
	language: ja
	tags:
	- image-to-text
	license: apache-2.0
	datasets:
	- manga109s
	---

	# Manga OCR

	Optical character recognition for Japanese text, with the main focus being Japanese manga.

	It uses [Vision Encoder Decoder](https://huggingface.co/docs/transformers/model_doc/visionencoderdecoder) framework.

	Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality
	text recognition, robust against various scenarios specific to manga:
	- both vertical and horizontal text
	- text with furigana
	- text overlaid on images
	- wide variety of fonts and font styles
	- low quality images

	Code is available [here](https://github.com/kha-white/manga_ocr).