File size: 942 Bytes
92fe6c2
 
eaeea13
 
 
 
e6b0176
 
 
91f256d
e6b0176
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1fc3492
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
---
license: mit
language:
- cs
tags:
- Document Question Answering
---
# LayoutLMv3 Model Fine-tuned with CIVQA (EasyOCR) dataset

This is a fine-tuned version of the [LayoutLMv3 model](https://huggingface.co/microsoft/layoutlmv3-base), which was trained on Czech Invoice Visual Question Answering (CIVQA) datasets containing invoices in the Czech language.

This model enables Document Visual Question Answering on Czech invoices.

All invoices used in this dataset were obtained from public sources. Over these invoices, we were focusing on 15 different entities, which are crucial for processing the invoices.
- Invoice number
- Variable symbol
- Specific symbol
- Constant symbol
- Bank code
- Account number
- ICO
- Total amount
- Invoice date
- Due date
- Name of supplier
- IBAN
- DIC
- QR code
- Supplier's address

You can find more information about this dataset in this [paper](https://nlp.fi.muni.cz/raslan/raslan23.pdf#page=31).