File size: 930 Bytes
202791c 0677c8c 2243313 c3ad48e 995fb0e c3ad48e 80d909d c3ad48e 3d1e813 2a5ee68 c3ad48e 98dce50 27fce77 80d909d 98dce50 27fce77 98dce50 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 |
---
license: mit
language:
- en
pipeline_tag: image-to-text
datasets:
- katanaml-org/invoices-donut-data-v1
---
## Sparrow - Data extraction from documents with ML
This model is finetuned Donut ML base model on invoices data. Model aims to verify how well Donut performs on enterprise docs.
Mean accuracy on test set: 0.96
Inference:
![Inference Results](https://raw.githubusercontent.com/katanaml/sparrow/main/sparrow-ui/assets/inference_actual.png)
Training loss:
![Training Loss](https://raw.githubusercontent.com/katanaml/sparrow/main/sparrow-ui/assets/donut_training_loss.png)
Sparrow on [GitHub](https://github.com/katanaml/sparrow)
Sample invoice [docs](https://github.com/katanaml/sparrow/tree/main/sparrow-ui/docs/images) to use for inference (docs up to 500 were used for fine-tuning, use docs from 500 for inference)
Our website [KatanaML](https://www.katanaml.io)
On [Twitter](https://twitter.com/katana_ml) |