Sparrow - Data extraction from documents with ML

This model is finetuned Donut ML base model on invoices data. Model aims to verify how well Donut performs on enterprise docs.

Mean accuracy on test set: 0.96

Inference:

Training loss:

Sparrow on GitHub

Sample invoice docs to use for inference (docs up to 500 were used for fine-tuning, use docs from 500 for inference)

Our website KatanaML

On Twitter

Downloads last month: 8

Inference Examples

Image-to-Text

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

ssraut
/

Extract_Matic

Sparrow - Data extraction from documents with ML

Dataset used to train ssraut/Extract_Matic