manifesto-project
/

manifestoberta-xlm-roberta-56policy-topics-sentence-2023-1-1

Text Classification

Inference Endpoints

Model card Files Files and versions Community

tburst commited on Sep 29, 2023

Commit

e0d7cc4

•

1 Parent(s): 2e8148f

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -3,8 +3,10 @@ license: mit
 ---
 ## Model description
-An xlm-roberta-large model fine-tuned on ~1,6 million annotated statements contained in the manifesto corpus (version 2023a).
-The model can be used to categorize any type of text into 56 different political topics according to the Manifesto Project's coding scheme (Handbook 4).
 ## How to use
@@ -37,8 +39,6 @@ print(predicted_class)
 ```
-## Model Performance
 ## Model Performance
 The model was evaluated on a test set of 199,046 annotated manifesto statements.

 ---
 ## Model description
+An xlm-roberta-large model fine-tuned on ~1,6 million annotated statements contained in the [Manifesto Corpus](https://manifesto-project.wzb.eu/information/documents/corpus) (version 2023a).
+The model can be used to categorize any type of text into 56 different political topics according to the Manifesto Project's coding scheme ([Handbook 4](https://manifesto-project.wzb.eu/coding_schemes/mp_v4)).
+It works for all languages the xlm-roberta model is pretrained on ([overview](https://github.com/facebookresearch/fairseq/tree/main/examples/xlmr#introduction)), just note that it will perform best for the 38 languages contained in the Manifesto Corpus:
 ## How to use
 ```
 ## Model Performance
 The model was evaluated on a test set of 199,046 annotated manifesto statements.