--- language: en license: cc-by-nc-sa-4.0 tags: - layoutlm - document-question-answering - pdf - invoices --- # LayoutLM for Invoices This is a fine-tuned version of the multi-modal [LayoutLM](https://aka.ms/layoutlm) model for the task of question answering on invoices and other documents. It has been fine-tuned on a proprietary dataset of invoices as well as both [SQuAD2.0](https://huggingface.co/datasets/squad_v2) and [DocVQA](https://www.docvqa.org/) for general comprehension. ## Non-consecutive tokens Unlike other QA models, which can only extract consecutive tokens (because they predict the start and end of a sequence), this model can predict longer-range, non-consecutive sequences with an additional classifier head. For example, QA models often encounter this failure mode: ![Broken Address](./before.png) However this model is able to predict non-consecutive tokens and therefore the address correctly: ![Two-line Address](./after.png) ## Getting started with the model The best way to use this model is via [DocQuery](https://github.com/impira/docquery). ## About us This model was created by the team at [Impira](https://www.impira.com/).