Gabriel commited on
Commit
181f098
1 Parent(s): 440664a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -8
README.md CHANGED
@@ -6,25 +6,26 @@ pipeline_tag: image-to-text
6
  tags:
7
  - HTR
8
  ---
 
9
  # Model Card: Swedish National Archives HTR Pipeline
10
 
11
  ## Model Description
12
- The Swedish National Archives presents an end-to-end Handwritten Text Recognition (HTR) pipeline for running-text documents ranging from 1600 to 1900. The pipeline consists of the following components:
13
 
14
  1. **RTMDet Instance Segmentation Models**: The pipeline utilizes two RTMDet instance segmentation models, trained using MMDetection. The first model is designed to segment text regions within the documents, while the second model focuses on segmenting text lines within these regions. These models enable the identification and localization of text areas, which is a crucial step in the HTR pipeline.
15
 
16
  2. **SATRN HTR Model**: The pipeline incorporates a SATRN (Spatial Attention Transformer Networks) model, trained using MMOCR (OpenMMLab's OCR toolbox). SATRN is a state-of-the-art model for HTR tasks and provides accurate recognition of handwritten text. The SATRN model is trained specifically to handle the characteristics and challenges of handwritten text present in the Swedish National Archives' documents.
17
 
18
- The models are designed to provide a generic pipeline for handwritten text recognition, offering robust performance for documents from the years 1600 to 1900.
19
 
20
  ## Intended Use
21
  The Swedish National Archives HTR pipeline is intended to be used for the following purposes:
22
 
23
- - Handwritten Text Recognition: The pipeline enables the automatic recognition of handwritten text in running-text documents from the 17th to the 19th century. It can be utilized by researchers, historians, and archivists to efficiently transcribe and analyze historical texts.
24
 
25
  - Document Digitization: The pipeline aids in the process of digitizing archival documents by automating the extraction and transcription of handwritten text. This facilitates broader accessibility and preservation of historical materials.
26
 
27
- It's important to note that the pipeline is optimized for running-text documents from the specified time period and may not perform optimally for other types of documents or handwriting styles.
28
 
29
  ## Performance and Limitations
30
  The performance of the Swedish National Archives HTR pipeline is influenced by several factors:
@@ -33,14 +34,14 @@ The performance of the Swedish National Archives HTR pipeline is influenced by s
33
 
34
  - **Speed**: The pipeline aims to provide real-time or near real-time performance for efficient processing of handwritten text documents. The speed may vary depending on the hardware used for inference.
35
 
36
- - **Document Specificity**: The pipeline is specifically trained for running-text documents from the 17th to the 19th century. It may not perform optimally for documents outside this time range or for documents with unique characteristics or handwriting styles not covered by the training data.
37
 
38
  - **Language Limitations**: The pipeline is tailored for Swedish text recognition. While it may handle other languages to some extent, its performance may not be as accurate as for Swedish.
39
 
40
  - **Handwriting Style**: The pipeline is optimized for the cursive handwriting style prevalent in the historical documents of the Swedish National Archives. It may not perform as well for other handwriting styles, such as block letters or highly stylized scripts.
41
 
42
  ## Training Data
43
- The Swedish National Archives HTR pipeline was trained using a diverse dataset of running-text documents from the 17th to the 19th century. The training data includes various types of historical texts, such as letters, manuscripts, and official records.
44
 
45
  The dataset comprises both high-quality and challenging examples to ensure the models' robustness. It covers a wide range of handwriting styles, legibility levels, and document conditions.
46
 
@@ -65,10 +66,13 @@ Although the Swedish National Archives HTR pipeline has been trained and optimiz
65
  - **User Feedback**: Users are encouraged to provide feedback on the pipeline's performance, identify issues, and report any potential biases or limitations. This feedback can contribute to refining the pipeline and addressing concerns.
66
 
67
  ## Conclusion
68
- The Swedish National Archives HTR pipeline offers a powerful solution for automatic handwritten text recognition in running-text documents from the 17th to the 19th century. It leverages instance segmentation models for text region and line extraction, combined with a state-of-the-art SATRN model for accurate recognition. While the pipeline demonstrates high accuracy, speed, and flexibility, users should be aware of its limitations and consider the ethical implications of using historical data. Regular updates and user feedback are crucial for maintaining and improving the pipeline's performance.
69
 
70
  For further inquiries or assistance, please reach out to the Swedish National Archives.
71
 
 
 
 
72
  ## References
73
  If you would like to learn more about the Swedish National Archives HTR pipeline or access the training data, please refer to the following resources:
74
 
@@ -76,4 +80,4 @@ If you would like to learn more about the Swedish National Archives HTR pipeline
76
  - [MMDetection](https://github.com/open-mmlab/mmdetection)
77
  - [MMOCR](https://github.com/open-mmlab/mmocr)
78
  - [SATRN Paper](https://arxiv.org/abs/2012.05483)
79
- - [OpenMMLab OCR Toolbox](https://openmmlab.com/mmocr/)
 
6
  tags:
7
  - HTR
8
  ---
9
+
10
  # Model Card: Swedish National Archives HTR Pipeline
11
 
12
  ## Model Description
13
+ The Swedish National Archives presents an end-to-end Handwritten Text Recognition (HTR) pipeline for running-text documents ranging from the 16th to the 19th century. The pipeline consists of the following components:
14
 
15
  1. **RTMDet Instance Segmentation Models**: The pipeline utilizes two RTMDet instance segmentation models, trained using MMDetection. The first model is designed to segment text regions within the documents, while the second model focuses on segmenting text lines within these regions. These models enable the identification and localization of text areas, which is a crucial step in the HTR pipeline.
16
 
17
  2. **SATRN HTR Model**: The pipeline incorporates a SATRN (Spatial Attention Transformer Networks) model, trained using MMOCR (OpenMMLab's OCR toolbox). SATRN is a state-of-the-art model for HTR tasks and provides accurate recognition of handwritten text. The SATRN model is trained specifically to handle the characteristics and challenges of handwritten text present in the Swedish National Archives' documents.
18
 
19
+ The models are designed to provide a generic pipeline for handwritten text recognition, offering robust performance for documents from the 16th to the 19th century.
20
 
21
  ## Intended Use
22
  The Swedish National Archives HTR pipeline is intended to be used for the following purposes:
23
 
24
+ - Handwritten Text Recognition: The pipeline enables the automatic recognition of handwritten text in running-text documents from the 16th to the 19th century. It can be utilized by researchers, historians, and archivists to efficiently transcribe and analyze historical texts.
25
 
26
  - Document Digitization: The pipeline aids in the process of digitizing archival documents by automating the extraction and transcription of handwritten text. This facilitates broader accessibility and preservation of historical materials.
27
 
28
+ It's important to note that the pipeline is optimized for running-text documents from the specified time period and may not perform optimally for other types of documents or handwriting styles. Additionally, it is currently more suitable for documents from books rather than complex layouts or tabular data found in newspapers.
29
 
30
  ## Performance and Limitations
31
  The performance of the Swedish National Archives HTR pipeline is influenced by several factors:
 
34
 
35
  - **Speed**: The pipeline aims to provide real-time or near real-time performance for efficient processing of handwritten text documents. The speed may vary depending on the hardware used for inference.
36
 
37
+ - **Document Specificity**: The pipeline is specifically trained for running-text documents from the 16th to the 19th century. It may not perform optimally for documents outside this time range or for documents with unique characteristics or handwriting styles not covered by the training data.
38
 
39
  - **Language Limitations**: The pipeline is tailored for Swedish text recognition. While it may handle other languages to some extent, its performance may not be as accurate as for Swedish.
40
 
41
  - **Handwriting Style**: The pipeline is optimized for the cursive handwriting style prevalent in the historical documents of the Swedish National Archives. It may not perform as well for other handwriting styles, such as block letters or highly stylized scripts.
42
 
43
  ## Training Data
44
+ The Swedish National Archives HTR pipeline was trained using a diverse dataset of running-text documents from the 16th to the 19th century. The training data includes various types of historical texts, such as letters, manuscripts, and official records.
45
 
46
  The dataset comprises both high-quality and challenging examples to ensure the models' robustness. It covers a wide range of handwriting styles, legibility levels, and document conditions.
47
 
 
66
  - **User Feedback**: Users are encouraged to provide feedback on the pipeline's performance, identify issues, and report any potential biases or limitations. This feedback can contribute to refining the pipeline and addressing concerns.
67
 
68
  ## Conclusion
69
+ The Swedish National Archives HTR pipeline offers a powerful solution for automatic handwritten text recognition in running-text documents from the 16th to the 19th century. It leverages instance segmentation models for text region and line extraction, combined with a state-of-the-art SATRN model for accurate recognition. While the pipeline demonstrates high accuracy, speed, and flexibility, users should be aware of its limitations and consider the ethical implications of using historical data. Regular updates and user feedback are crucial for maintaining and improving the pipeline's performance.
70
 
71
  For further inquiries or assistance, please reach out to the Swedish National Archives.
72
 
73
+ ## Demo
74
+ You can try out a demo of the Swedish National Archives HTR pipeline at [Riksarkivet HTR Demo](https://huggingface.co/spaces/Riksarkivet/htr_demo).
75
+
76
  ## References
77
  If you would like to learn more about the Swedish National Archives HTR pipeline or access the training data, please refer to the following resources:
78
 
 
80
  - [MMDetection](https://github.com/open-mmlab/mmdetection)
81
  - [MMOCR](https://github.com/open-mmlab/mmocr)
82
  - [SATRN Paper](https://arxiv.org/abs/2012.05483)
83
+ - [OpenMMLab OCR Toolbox](https://openmmlab.com/mmocr/)