--- license: mit datasets: - felipebandeira/driverlicenses2k language: - en metrics: - accuracy pipeline_tag: image-to-text --- This model extracts information from EU driver's licenses and returns it as JSON. For optimal performance, we recommend that input images: - have a size of 1192x772 - have high resolution and do not contain light reflection effects Accuracy - on validation set: 98% - on set of real licenses: 63.93% Article describing model: https://medium.com/@ofelipebandeira/transformers-vs-ocr-who-can-read-better-192e6b044dd3 Article describing synthetic dataset used in training: https://python.plainenglish.io/how-to-create-synthetic-datasets-of-document-images-5f140dee5e40