poltextlab
/

xlm-roberta-large-german-cap

@@ -3,7 +3,7 @@
 ---
 license: mit
 language:
-- multilingual
 tags:
 - zero-shot-classification
 - text-classification
@@ -14,7 +14,7 @@ metrics:
 ---
 # xlm-roberta-large-german-cap
 ## Model description
-An `xlm-roberta-large` model finetuned on multilingual training data labelled with [major topic codes](https://www.comparativeagendas.net/pages/master-codebook) from the [Comparative Agendas Project](https://www.comparativeagendas.net/).
 ## How to use the model
 #### Loading and tokenizing input data
@@ -90,34 +90,33 @@ training_args = TrainingArguments(
 We also incorporated an EarlyStoppingCallback in the process with a patience of 2 epochs.
 ## Model performance
-The model was evaluated on a test set of 5227 examples (10% of the available data).<br>
-Model accuracy is **0.7**.
 | label        |   precision |   recall |   f1-score |   support |
 |:-------------|------------:|---------:|-----------:|----------:|
-| 0            |        0.62 |     0.65 |       0.63 |       510 |
-| 1            |        0.73 |     0.67 |       0.7  |       406 |
-| 2            |        0.8  |     0.83 |       0.81 |       239 |
-| 3            |        0.75 |     0.78 |       0.76 |       136 |
-| 4            |        0.72 |     0.65 |       0.68 |       386 |
-| 5            |        0.79 |     0.82 |       0.81 |       331 |
-| 6            |        0.76 |     0.65 |       0.7  |       264 |
-| 7            |        0.81 |     0.79 |       0.8  |       189 |
-| 8            |        0.8  |     0.8  |       0.8  |       121 |
-| 9            |        0.78 |     0.83 |       0.81 |       151 |
-| 10           |        0.72 |     0.66 |       0.69 |       304 |
-| 11           |        0.67 |     0.67 |       0.67 |       452 |
-| 12           |        0.68 |     0.72 |       0.7  |       184 |
-| 13           |        0.54 |     0.64 |       0.59 |       269 |
-| 14           |        0.78 |     0.7  |       0.74 |       212 |
-| 15           |        0.63 |     0.77 |       0.69 |       154 |
-| 16           |        0.53 |     0.57 |       0.55 |        61 |
-| 17           |        0.75 |     0.71 |       0.73 |       471 |
-| 18           |        0.62 |     0.62 |       0.62 |       301 |
-| 19           |        0.27 |     0.5  |       0.35 |         8 |
-| 20           |        0.79 |     0.82 |       0.81 |        78 |
-| 21           |        0    |     0    |       0    |         0 |
-| macro avg    |        0.66 |     0.68 |       0.67 |      5227 |
-| weighted avg |        0.71 |     0.7  |       0.7  |      5227 |
 ## Inference platform
 This model is used by the [CAP Babel Machine](https://babel.poltextlab.com), an open-source and free natural language processing tool, designed to simplify and speed up projects for comparative research.

 ---
 license: mit
 language:
+- de
 tags:
 - zero-shot-classification
 - text-classification
 ---
 # xlm-roberta-large-german-cap
 ## Model description
+An `xlm-roberta-large` model finetuned on german training data labelled with [major topic codes](https://www.comparativeagendas.net/pages/master-codebook) from the [Comparative Agendas Project](https://www.comparativeagendas.net/).
 ## How to use the model
 #### Loading and tokenizing input data
 We also incorporated an EarlyStoppingCallback in the process with a patience of 2 epochs.
 ## Model performance
+The model was evaluated on a test set of 601 examples (10% of the available data).<br>
+Model accuracy is **0.71**.
 | label        |   precision |   recall |   f1-score |   support |
 |:-------------|------------:|---------:|-----------:|----------:|
+| 0            |        0.92 |     0.65 |       0.76 |        17 |
+| 1            |        0.77 |     0.84 |       0.8  |        61 |
+| 2            |        0.76 |     0.76 |       0.76 |        21 |
+| 3            |        0.82 |     0.82 |       0.82 |        17 |
+| 4            |        0.61 |     0.56 |       0.58 |        25 |
+| 5            |        0.5  |     0.56 |       0.53 |         9 |
+| 6            |        0.71 |     0.76 |       0.74 |        46 |
+| 7            |        0.62 |     0.83 |       0.71 |        24 |
+| 8            |        0.57 |     0.77 |       0.65 |        22 |
+| 9            |        0.85 |     0.85 |       0.85 |        47 |
+| 10           |        0.71 |     0.61 |       0.66 |        36 |
+| 11           |        0.36 |     0.36 |       0.36 |        14 |
+| 12           |        0.73 |     0.89 |       0.8  |         9 |
+| 13           |        0.59 |     0.62 |       0.6  |        21 |
+| 14           |        0.76 |     0.83 |       0.8  |        66 |
+| 15           |        0.56 |     0.56 |       0.56 |        16 |
+| 16           |        1    |     0.14 |       0.25 |         7 |
+| 17           |        0.75 |     0.72 |       0.73 |        61 |
+| 18           |        0.72 |     0.64 |       0.68 |        77 |
+| 19           |        0    |     0    |       0    |         3 |
+| 20           |        0    |     0    |       0    |         2 |
+| macro avg    |        0.63 |     0.61 |       0.6  |       601 |
+| weighted avg |        0.71 |     0.71 |       0.71 |       601 |
 ## Inference platform
 This model is used by the [CAP Babel Machine](https://babel.poltextlab.com), an open-source and free natural language processing tool, designed to simplify and speed up projects for comparative research.