microsoft
/

layoutlmv2-base-uncased

Inference Endpoints

Model card Files Files and versions Community

Yiheng Xu commited on May 20, 2021

Commit

3356537

•

1 Parent(s): 9bd21eb

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -1,6 +1,13 @@
 # LayoutLMv2
 **Multimodal (text + layout/format + image) pre-training for document AI**
 ## Introduction
 LayoutLMv2 is an improved version of LayoutLM with new pre-training tasks to model the interaction among text, layout, and image in a single multi-modal framework. It outperforms strong baselines and achieves new state-of-the-art results on a wide variety of downstream visually-rich document understanding tasks, including , including FUNSD (0.7895 → 0.8420), CORD (0.9493 → 0.9601), SROIE (0.9524 → 0.9781), Kleister-NDA (0.834 → 0.852), RVL-CDIP (0.9443 → 0.9564), and DocVQA (0.7295 → 0.8672).

+---
+language: en
+license: cc-by-sa-4.0
+---
 # LayoutLMv2
 **Multimodal (text + layout/format + image) pre-training for document AI**
+[Github Repository](https://github.com/microsoft/unilm/tree/master/layoutlmv2)
 ## Introduction
 LayoutLMv2 is an improved version of LayoutLM with new pre-training tasks to model the interaction among text, layout, and image in a single multi-modal framework. It outperforms strong baselines and achieves new state-of-the-art results on a wide variety of downstream visually-rich document understanding tasks, including , including FUNSD (0.7895 → 0.8420), CORD (0.9493 → 0.9601), SROIE (0.9524 → 0.9781), Kleister-NDA (0.834 → 0.852), RVL-CDIP (0.9443 → 0.9564), and DocVQA (0.7295 → 0.8672).