Dochee
/

xlm-roberta-base-finetuned-panx-de

Token Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Dochee commited on Feb 27, 2023

Commit

e57ca99

•

1 Parent(s): 018ef93

Update README.md

Files changed (1) hide show

README.md +10 -2

README.md CHANGED Viewed

@@ -35,8 +35,16 @@ It achieves the following results on the evaluation set:
 - F1: 0.8638
 ## Model description
-More information needed
 ## Intended uses & limitations
@@ -75,4 +83,4 @@ The following hyperparameters were used during training:
 - Transformers 4.26.1
 - Pytorch 1.13.1+cu116
 - Datasets 2.10.0
-- Tokenizers 0.13.2

 - F1: 0.8638
 ## Model description
+Multilingual Named Entity Recognition across several languages
+For this project's token classification, I built a unique custom model head.
+WikiANN or PAN-X.2, a subset of the Cross-lingual TRansfer Evaluation of Multilingual
+Encoders (XTREME) benchmark, was applied. This project was completed for a customer based
+in switzerland, where the four languages that are most frequently spoken are
+German (62.9% of articles), French (22.9%), Italian (8.4%), and English (5.9%).
+Each article is tagged with "inside-outside-beginning" (IOB2) tags for LOC (place),
+PER (person), and ORG (organization).
 ## Intended uses & limitations
 - Transformers 4.26.1
 - Pytorch 1.13.1+cu116
 - Datasets 2.10.0
+- Tokenizers 0.13.2