microsoft/trocr-base-printed · can I use this model to extract text from an entire document?

Feb 25, 2023

Hey there, I am working on a PDF parsing project.

Is there a way to use this model to extract an entire page?

OR - are there any other models capable of extracting text from images like these? (don't mind the red rectangle)
I tried other python libraries and the results are bad

P.S. yes, I am using another model to detect tables and remove them in order to improve the parsing

P.P.S. yes, the image above is taken from "attention is all you need" lol

wanbiguizhao

Apr 26, 2023

Hey there, I am working on a PDF parsing project.

Is there a way to use this model to extract an entire page?

OR - are there any other models capable of extracting text from images like these? (don't mind the red rectangle)
I tried other python libraries and the results are bad

P.S. yes, I am using another model to detect tables and remove them in order to improve the parsing

P.P.S. yes, the image above is taken from "attention is all you need" lol
maybe you can try layoutlmv3 ,which can analysis document layout,help detect table ,title,text,etc

wanbiguizhao

Apr 26, 2023

https://arxiv.org/pdf/2204.08387.pdf paper

ldemiguel

May 30, 2023

At the end did you find an answer for extract an entire page?

wanbiguizhao

Jun 2, 2023

At the end did you find an answer for extract an entire page?

i'm trying