owinton commited on
Commit
d8c26a0
1 Parent(s): 24bc618

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -4
README.md CHANGED
@@ -7,12 +7,11 @@ f1-score: 0.83
7
  ---
8
  # Model Card for A&ttack2
9
 
10
- Text classification model that determines whether a not a short text contains an attack.
11
-
12
 
13
  # Model Description
14
 
15
- The model is based on the [North-T5-NCC Large](https://huggingface.co/north/t5_large_NCC) (developed by Per E. Kummervold) which is a Scandinavian language built upon [T5](https://github.com/google-research/text-to-text-transfer-transformer) and [T5X](https://github.com/google-research/t5x). The model is further trained on ~70k Norwegian and ~67k Danish social media posts which have been classified as either 'attack' or 'not attack', making it a text-to-text model manipulated to do classification. The model is described in Danish in [this report](https://strapi.ogtal.dk/uploads/966f1ebcfa9942d3aef338e9920611f4.pdf).
16
 
17
 
18
  - **Developed by:** The development team at Analyse & Tal
@@ -32,7 +31,7 @@ This model can be used for classifying Danish and Norwegian social media posts o
32
  [More Information Needed]
33
 
34
  # Training Data
35
- A collection of ~70k Norwegian and ~67k Danish social media posts have been manually annotated as 'attack' or 'not attack' by six individual coders. 5% of the posts have been annotated by more then one annotator, with the annotators in agreement for 83% of annotations.
36
 
37
  [More information needed on the data split method and the training-validation-test split.]
38
 
 
7
  ---
8
  # Model Card for A&ttack2
9
 
10
+ A text classification model for determining if a social media post in Danish or Norwegian contains a verbal attack.
 
11
 
12
  # Model Description
13
 
14
+ The model is based on the [North-T5-NCC Large](https://huggingface.co/north/t5_large_NCC) (developed by Per E. Kummervold) which is a Scandinavian language built upon [T5](https://github.com/google-research/text-to-text-transfer-transformer) and [T5X](https://github.com/google-research/t5x). The model is further trained on ~70k Norwegian and ~67k Danish social media posts which have been classified as either 'verbal attack' or 'nothing', making it a text-to-text model restricted to do classification. The model is described in Danish in [this report](https://strapi.ogtal.dk/uploads/966f1ebcfa9942d3aef338e9920611f4.pdf).
15
 
16
 
17
  - **Developed by:** The development team at Analyse & Tal
 
31
  [More Information Needed]
32
 
33
  # Training Data
34
+ A collection of ~70k Norwegian and ~67k Danish social media posts have been manually annotated as 'verbal attack' or 'nothing' by annotators. 5% of the posts have been annotated by more then one annotator, with the annotators in agreement for 83% of annotations.
35
 
36
  [More information needed on the data split method and the training-validation-test split.]
37