Upload latest model

Files changed (6) hide show

README.md CHANGED Viewed

@@ -10,32 +10,42 @@ metrics:
 - WER
 language:
 - en
 ---
-# English handwritten text recognition
-This model performs Handwritten Text Recognition in English.
 ## Model description
-The model has been trained using the PyLaia library on the [IAM](https://fki.tic.heia-fr.ch/databases/iam-handwriting-database) dataset.
-Training images were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
 ## Evaluation results
 The model achieves the following results:
-| Split | CER (%) | WER (%) | Support |
-| ----- | ------- | ------- | ------- |
-| train | 0.32    | 1.26    | 6482    |
-| val   | 6.50    | 19.12   | 1926    |
-| test  | 7.68    | 19.82   | 1965    |
-A similar model was trained on the RWTH split, corresponding to the results published in [Key-value information extraction from full handwritten pages](https://arxiv.org/pdf/2304.13530.pdf).
-Results can be improved by combining PyLaia with a n-gram language model.
 ## How to use
 Please refer to the [documentation](https://atr.pages.teklia.com/pylaia/).

 - WER
 language:
 - en
+datasets:
+- Teklia/IAM
 ---
+# IAM handwritten text recognition
+This model performs Handwritten Text Recognition in English on modern documents.
 ## Model description
+The model was trained using the PyLaia library on the [IAM database](https://fki.tic.heia-fr.ch/databases/iam-handwriting-database).
+For training, text-lines were resized with a fixed height of 128 pixels, keeping the original aspect ratio.
+An external 6-gram character language model can be used to improve recognition. The language model is trained on the text from the IAM training set.
 ## Evaluation results
 The model achieves the following results:
+| set   | Language model | CER (%)    | WER (%) | N lines   |
+|:------|:---------------|:----------:|:-------:|----------:|
+| test  | no             | 8.44       | 24.51   |      2915 |
+| test  | yes            | 7.50       | 20.98   |      2915 |
 ## How to use
 Please refer to the [documentation](https://atr.pages.teklia.com/pylaia/).
+## Cite us
+```bibtex
+@inproceedings{pylaia-lib,
+    author = "Tarride, Solène and Schneider, Yoann and Generali, Marie and Boillet, Melodie and Abadie, Bastien and Kermorvant, Christopher",
+    title = "Improving Automatic Text Recognition with Language Models in the PyLaia Open-Source Library",
+    booktitle = "Submitted at ICDAR2024",
+    year = "2024"
+}
+```

language_model.arpa.gz ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a2ea36e75faa0d9f3e4def71e674cbbbe3d52bc7056d20218372b50fbf999ad6
+size 5355981

lexicon.txt ADDED Viewed

+▁ <space>
+! !
+" "
+# #
+& &
+' '
+( (
+) )
+* *
++ +
+, ,
+- -
+. .
+/ /
+0 0
+1 1
+2 2
+3 3
+4 4
+5 5
+6 6
+7 7
+8 8
+9 9
+: :
+; ;
+? ?
+A A
+B B
+C C
+D D
+E E
+F F
+G G
+H H
+I I
+J J
+K K
+L L
+M M
+N N
+O O
+P P
+Q Q
+R R
+S S
+T T
+U U
+V V
+W W
+X X
+Y Y
+Z Z
+a a
+b b
+c c
+d d
+e e
+f f
+g g
+h h
+i i
+j j
+k k
+l l
+m m
+n n
+o o
+p p
+q q
+r r
+s s
+t t
+u u
+v v
+w w
+x x
+y y
+z z
+◌ <ctc>

model CHANGED Viewed

Binary files a/model and b/model differ

tokens.txt ADDED Viewed

+<ctc>
+!
+"
+#
+&
+'
+(
+)
+*
++
+,
+-
+.
+/
+0
+1
+2
+3
+4
+5
+6
+7
+8
+9
+:
+;
+?
+A
+B
+C
+D
+E
+F
+G
+H
+I
+J
+K
+L
+M
+N
+O
+P
+Q
+R
+S
+T
+U
+V
+W
+X
+Y
+Z
+a
+b
+c
+d
+e
+f
+g
+h
+i
+j
+k
+l
+m
+n
+o
+p
+q
+r
+s
+t
+u
+v
+w
+x
+y
+z
+<unk>
+<space>

weights.ckpt CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6c5cd4f1157c2b7768fdef5eb0f3264270b477d111f964c7e78a1b18783d09ed
-size 42673218

 version https://git-lfs.github.com/spec/v1
+oid sha256:9b9541eb80007bc817bbe5b91828f3dc3ddc7e461d3480bf14cc6931458474b2
+size 42671836