nielsr
/

layoutxlm-finetuned-xfund-fr

Token Classification

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

nielsr HF staff commited on Sep 19, 2022

Commit

7965c30

•

1 Parent(s): 18ced62

Update README.md

Files changed (1) hide show

README.md +22 -4

README.md CHANGED Viewed

@@ -13,17 +13,35 @@ model-index:
 This model is a fine-tuned version of [microsoft/layoutxlm-base](https://huggingface.co/microsoft/layoutxlm-base) on the [XUND](https://github.com/doc-analysis/XFUND) dataset (French split).
-## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 This model is a fine-tuned version of [microsoft/layoutxlm-base](https://huggingface.co/microsoft/layoutxlm-base) on the [XUND](https://github.com/doc-analysis/XFUND) dataset (French split).
+## Model usage
+Here's how to use this model:
+```
+from transformers import AutoProcessor, AutoModelForTokenClassification
+from PIL import Image
+processor = AutoProcessor.from_pretrained("nielsr/layoutxlm-finetuned-xfund-fr")
+model = AutoModelForTokenClassification.from_pretrained(nielsr/layoutxlm-finetuned-xfund-fr")
+# assuming you have a French document, turned into an image
+image = Image("...").convert("RGB")
+# prepare for the model
+encoding = processor(image, return_tensors="pt")
+with torch.no_grad():
+  outputs = model(**encoding)
+  logits = outputs.logits
+```
 ## Intended uses & limitations
+This model can be used for NER on French scanned documents. It can recognize 4 categories: "question", "answer", "header" and "other".
 ## Training and evaluation data
+This checkpoint used the French portion of the multilingual [XUND](https://github.com/doc-analysis/XFUND) dataset.
 ## Training procedure