File size: 930 Bytes
202791c
 
 
 
0677c8c
 
 
2243313
 
c3ad48e
 
995fb0e
c3ad48e
80d909d
 
c3ad48e
3d1e813
2a5ee68
 
c3ad48e
 
 
 
98dce50
27fce77
80d909d
 
98dce50
27fce77
98dce50
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
license: mit
language:
- en
pipeline_tag: image-to-text
datasets:
- katanaml-org/invoices-donut-data-v1
---

## Sparrow - Data extraction from documents with ML

This model is finetuned Donut ML base model on invoices data. Model aims to verify how well Donut performs on enterprise docs.

Mean accuracy on test set: 0.96

Inference:

![Inference Results](https://raw.githubusercontent.com/katanaml/sparrow/main/sparrow-ui/assets/inference_actual.png)

Training loss:

![Training Loss](https://raw.githubusercontent.com/katanaml/sparrow/main/sparrow-ui/assets/donut_training_loss.png)

Sparrow on [GitHub](https://github.com/katanaml/sparrow)

Sample invoice [docs](https://github.com/katanaml/sparrow/tree/main/sparrow-ui/docs/images) to use for inference (docs up to 500 were used for fine-tuning, use docs from 500 for inference)

Our website [KatanaML](https://www.katanaml.io)

On [Twitter](https://twitter.com/katana_ml)