File size: 1,059 Bytes
d13bba3
9343926
d13bba3
9343926
 
 
 
 
 
 
 
 
d13bba3
9343926
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5254ab2
 
 
9343926
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
---
library_name: PyLaia
license: mit
tags:
- PyLaia
- PyTorch
- Handwritten text recognition
metrics:
- CER
- WER
language:
- en
---

# English printed text recognition

This model performs Handwritten Text Recognition in English. 

## Model description

The model has been trained using the PyLaia library on the [IAM](https://fki.tic.heia-fr.ch/databases/iam-handwriting-database) dataset.
Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.


## Evaluation results

The model achieves the following results:

| Split | CER (%) | WER (%) | Support |
| ----- | ------- | ------- | ------- |
| train | 0.32    | 1.26    | 6482    |
| val   | 6.50    | 19.12   | 1926    |
| test  | 7.68    | 19.82   | 1965    |

These results were published [Key-value information extraction from full
handwritten pages](https://arxiv.org/pdf/2304.13530.pdf).

Results can be improved by combining PyLaia with a n-gram language model. 

## How to use

Please refer to the [documentation](https://atr.pages.teklia.com/pylaia/).