Using this model as a QA-tool/OCR on a text heavy document?

#24

by Techie5879 - opened Oct 23, 2023

Oct 23, 2023

Can this model be accurately used as a QA-tool/OCR text extraction from a text heavy document? Current OCR solutions struggle with multi column formats. We want to be able to extract text efficiently from such documents and put them into a structured format (JSON). Can this model be used for this? And if so, does it have to be fine tuned or can we try out-of-the box inference by changing the test image and the prompt and expect results?

Molbap

Oct 24, 2023

I don't know if the model was trained/tested on text-heavy documents @Techie5879 - as a matter of fact I'll evaluate it on a document QA task soon because I'm curious about it as well

rfhuang

Nov 8, 2023

Working the problem myself, in the meantime any updates @Molbap ?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment