Sharka
/

CIVQA_LayoutXLM

Document Question Answering

document question answering

Model card Files Files and versions Community

LayoutXLM Model Fine-tuned with CIVQA (Tesseract) dataset

This is a fine-tuned version of the LayoutXLM model, which was trained on Czech Invoice Visual Question Answering (CIVQA) datasets containing invoices in the Czech language.

This model enables Document Visual Question Answering on Czech invoices.

All invoices used in this dataset were obtained from public sources. Over these invoices, we were focusing on 15 different entities, which are crucial for processing the invoices.

Invoice number
Variable symbol
Specific symbol
Constant symbol
Bank code
Account number
ICO
Total amount
Invoice date
Due date
Name of supplier
IBAN
DIC
QR code
Supplier's address

You can find more information about this model in this paper.

Downloads last month: 0

Inference Providers NEW

Document Question Answering

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train Sharka/CIVQA_LayoutXLM