sijunhe commited on
Commit
f95c278
1 Parent(s): ca723fa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +40 -1
README.md CHANGED
@@ -1,4 +1,43 @@
1
  ---
2
  library_name: paddlenlp
 
 
 
 
3
  ---
4
- # PaddlePaddle/ernie-layoutx-base-uncased
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  library_name: paddlenlp
3
+ license: apache-2.0
4
+ language:
5
+ - en
6
+ - zh
7
  ---
8
+
9
+ [![paddlenlp-banner](https://user-images.githubusercontent.com/1371212/175816733-8ec25eb0-9af3-4380-9218-27c154518258.png)](https://github.com/PaddlePaddle/PaddleNLP)
10
+
11
+ # PaddlePaddle/ernie-layoutx-base-uncased
12
+
13
+ ## Introduction
14
+
15
+ Recent years have witnessed the rise and success of pre-training techniques in visually-rich document understanding.
16
+ However, most existing methods lack the systematic mining and utilization of layout-centered knowledge, leading to sub-optimal performances.
17
+ In this paper, we propose ERNIE-Layout, a novel document pre-training solution with layout knowledge enhancement in the whole workflow,
18
+ to learn better representations that combine the features from text, layout, and image. Specifically, we first rearrange input sequences
19
+ in the serialization stage, and then present a correlative pre-training task, reading order prediction, to learn the proper reading order of documents.
20
+ To improve the layout awareness of the model, we integrate a spatial-aware disentangled attention into the multi-modal transformer and
21
+ a replaced regions prediction task into the pre-training phase. Experimental results show that ERNIE-Layout achieves superior performance
22
+ on various downstream tasks, setting new state-of-the-art on key information extraction, document image classification, and document question answering datasets.
23
+
24
+ More detail: https://arxiv.org/abs/2210.06155
25
+
26
+ ## Available Models
27
+
28
+ - ernie-layoutx-base-uncased
29
+
30
+ ## How to Use?
31
+
32
+ Click on the *Use in paddlenlp* button on the top right!
33
+
34
+ ## Citation Info
35
+
36
+ ```text
37
+ @article{ernie2.0,
38
+ title = {ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding},
39
+ author = {Peng, Qiming and Pan, Yinxu and Wang, Wenjin and Luo, Bin and Zhang, Zhenyu and Huang, Zhengjie and Hu, Teng and Yin, Weichong and Chen, Yongfeng and Zhang, Yin and Feng, Shikun and Sun, Yu and Tian, Hao and Wu, Hua and Wang, Haifeng},
40
+ journal={arXiv preprint arXiv:2210.06155},
41
+ year = {2022},
42
+ }
43
+ ```