DILHTWD commited on
Commit
e0129a4
1 Parent(s): d88fc0e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -12,12 +12,12 @@ metrics:
12
 
13
  This model was developed to address the challenges of Document Layout Segmentation and Document Layout Analysis by accurately segmenting a document page into its core components. These components include the title, captions, footnotes, formulas, list items, page footers, page headers, and pictures. The motivation behind creating this model stems from the need to enhance the understanding and accessibility of document content, facilitating a wide range of applications such as automated content extraction, document summarization, and improved accessibility features. By providing precise segmentation of these elements, the model aims to support various downstream tasks that rely on the structural understanding of document layouts, enabling more efficient and effective processing and analysis of document content.
14
 
15
- ##Training Data:
16
  - **Source:** DocLayNet, IBM (https://github.com/DS4SD/DocLayNet)
17
  - **Classes:** 11 classes (Caption, Footnote, Formula, List-item, Page-footer, Page-header, Picture, Section-header, Table, Text, and Title)
18
  - **Pages:** 80,863 document pages
19
 
20
- ##Performance
21
  Metrics:
22
  - **Precision:** 0.98
23
  - **Recall:** 0.97
 
12
 
13
  This model was developed to address the challenges of Document Layout Segmentation and Document Layout Analysis by accurately segmenting a document page into its core components. These components include the title, captions, footnotes, formulas, list items, page footers, page headers, and pictures. The motivation behind creating this model stems from the need to enhance the understanding and accessibility of document content, facilitating a wide range of applications such as automated content extraction, document summarization, and improved accessibility features. By providing precise segmentation of these elements, the model aims to support various downstream tasks that rely on the structural understanding of document layouts, enabling more efficient and effective processing and analysis of document content.
14
 
15
+ ## Training Data:
16
  - **Source:** DocLayNet, IBM (https://github.com/DS4SD/DocLayNet)
17
  - **Classes:** 11 classes (Caption, Footnote, Formula, List-item, Page-footer, Page-header, Picture, Section-header, Table, Text, and Title)
18
  - **Pages:** 80,863 document pages
19
 
20
+ ## Performance
21
  Metrics:
22
  - **Precision:** 0.98
23
  - **Recall:** 0.97