Update README.md
Browse files
README.md
CHANGED
@@ -3,8 +3,10 @@ license: mit
|
|
3 |
---
|
4 |
|
5 |
## Model description
|
6 |
-
An xlm-roberta-large model fine-tuned on ~1,6 million annotated statements contained in the manifesto
|
7 |
-
The model can be used to categorize any type of text into 56 different political topics according to the Manifesto Project's coding scheme (Handbook 4).
|
|
|
|
|
8 |
|
9 |
## How to use
|
10 |
|
@@ -37,8 +39,6 @@ print(predicted_class)
|
|
37 |
```
|
38 |
|
39 |
|
40 |
-
## Model Performance
|
41 |
-
|
42 |
## Model Performance
|
43 |
|
44 |
The model was evaluated on a test set of 199,046 annotated manifesto statements.
|
|
|
3 |
---
|
4 |
|
5 |
## Model description
|
6 |
+
An xlm-roberta-large model fine-tuned on ~1,6 million annotated statements contained in the [Manifesto Corpus](https://manifesto-project.wzb.eu/information/documents/corpus) (version 2023a).
|
7 |
+
The model can be used to categorize any type of text into 56 different political topics according to the Manifesto Project's coding scheme ([Handbook 4](https://manifesto-project.wzb.eu/coding_schemes/mp_v4)).
|
8 |
+
It works for all languages the xlm-roberta model is pretrained on ([overview](https://github.com/facebookresearch/fairseq/tree/main/examples/xlmr#introduction)), just note that it will perform best for the 38 languages contained in the Manifesto Corpus:
|
9 |
+
|
10 |
|
11 |
## How to use
|
12 |
|
|
|
39 |
```
|
40 |
|
41 |
|
|
|
|
|
42 |
## Model Performance
|
43 |
|
44 |
The model was evaluated on a test set of 199,046 annotated manifesto statements.
|