scharnot commited on
Commit
1894b4e
1 Parent(s): b29d425

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - donut
4
+ - image-to-text
5
+ - invoices
6
+ ---
7
+ # Overview
8
+ This repository contains a fine-tuned version of the Donut model for document understanding, specifically tailored for invoice processing. The Donut model is based on the OCR-free Document Understanding Transformer, introduced in the paper by Geewok et al. OCR-free Document Understanding Transformer, and initially released in the repository https://github.com/clovaai/donut.
9
+
10
+ The purpose of this custom fine-tuning is to enhance the Donut model's performance specifically for invoice analysis and extraction. The model was trained on a custom dataset of annotated invoices, comprising several hundred examples. Although the dataset is not included in this repository, details on its availability will be provided later.
11
+
12
+ # Model Details
13
+
14
+ The Donut model is a transformer-based architecture that leverages self-attention mechanisms for document understanding. By fine-tuning the model with a custom dataset of invoices, we aim to improve its ability to accurately extract relevant information from invoices, such as vendor details, billing information, line items, and totals.
15
+
16
+ [Demo can be found here](https://colab.research.google.com/drive/1zDvSysp24bCk60LR6172Z94eY1mRhKWF#scrollTo=f7RoSOEXUa6i)