Spaces:

breezedeus
/

Pix2Text-Demo

Running

App Files Files Community

Pix2Text-Demo / README.md

breezedeus

p2t v1.0

d917a85 3 months ago

preview code

raw history blame contribute delete

No virus

2.34 kB

	---
	title: Pix2Text
	emoji: ♾️
	colorFrom: red
	colorTo: blue
	sdk: gradio
	sdk_version: 4.19.2
	app_file: app.py
	pinned: false
	license: mit
	---

	# Pix2Text (P2T)

	[Pix2Text (P2T)](https://github.com/breezedeus/pix2text) aims to be a free and open-source Python alternative to [Mathpix](https://mathpix.com/). It can already complete the core functionalities of Mathpix. Starting from V0.2, Pix2Text (P2T) supports recognizing mixed images containing both text and formulas, with output similar to Mathpix. The core principles of P2T are shown below (text recognition supports both Chinese and English):

	<div align="center"> <img src="https://www.notion.so/image/https%3A%2F%2Fs3-us-west-2.amazonaws.com%2Fsecure.notion-static.com%2F8afb65f8-fd1d-48b9-978a-688554cc759a%2FUntitled.jpeg?table=block&id=39580ae6-09e5-4631-a611-e80e720f3877" alt="Pix2Text workflow" width="600px"/> </div>

	P2T utilizes the open-source tool [CnSTD](https://github.com/breezedeus/cnstd) to detect the locations of mathematical formulas in images. These detected areas are then processed by P2T's own formula recognition engine (LatexOCR) to recognize the LaTeX representation of each mathematical formula. The remaining parts of the image are processed by a text recognition engine ([CnOCR](https://github.com/breezedeus/cnocr) or [EasyOCR](https://github.com/JaidedAI/EasyOCR)) for text detection and recognition. Finally, P2T merges all recognition results to obtain the final image recognition outcome. Thanks to these great open-source projects!

	For beginners who are not familiar with Python, we also provide the free-to-use [P2T Online Service](https://p2t.breezedeus.com/). Just upload your image and it will output the P2T parsing results. The online service uses the latest models and works better than the open-source ones.


	The author also maintains Planet of Knowledge [P2T/CnOCR/CnSTD Private Group](https://t.zsxq.com/FEYZRJQ), welcome to join. The Planet of Knowledge Private Group will release some P2T/CnOCR/CnSTD related private materials one after another, including non-public models, discount for paid models, answers to problems encountered during usage, etc. This group also releases the latest research materials related to VIE/OCR/STD.