tburst commited on
Commit
e0d7cc4
1 Parent(s): 2e8148f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -3,8 +3,10 @@ license: mit
3
  ---
4
 
5
  ## Model description
6
- An xlm-roberta-large model fine-tuned on ~1,6 million annotated statements contained in the manifesto corpus (version 2023a).
7
- The model can be used to categorize any type of text into 56 different political topics according to the Manifesto Project's coding scheme (Handbook 4).
 
 
8
 
9
  ## How to use
10
 
@@ -37,8 +39,6 @@ print(predicted_class)
37
  ```
38
 
39
 
40
- ## Model Performance
41
-
42
  ## Model Performance
43
 
44
  The model was evaluated on a test set of 199,046 annotated manifesto statements.
 
3
  ---
4
 
5
  ## Model description
6
+ An xlm-roberta-large model fine-tuned on ~1,6 million annotated statements contained in the [Manifesto Corpus](https://manifesto-project.wzb.eu/information/documents/corpus) (version 2023a).
7
+ The model can be used to categorize any type of text into 56 different political topics according to the Manifesto Project's coding scheme ([Handbook 4](https://manifesto-project.wzb.eu/coding_schemes/mp_v4)).
8
+ It works for all languages the xlm-roberta model is pretrained on ([overview](https://github.com/facebookresearch/fairseq/tree/main/examples/xlmr#introduction)), just note that it will perform best for the 38 languages contained in the Manifesto Corpus:
9
+
10
 
11
  ## How to use
12
 
 
39
  ```
40
 
41
 
 
 
42
  ## Model Performance
43
 
44
  The model was evaluated on a test set of 199,046 annotated manifesto statements.