KennethTM commited on
Commit
6b60f9c
1 Parent(s): 4af0d75

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -17
README.md CHANGED
@@ -49,20 +49,34 @@ Initially, only the word token embeddings are trained using 1.000.000 samples. F
49
 
50
  The performance of the pretrained model was evaluated using [ScandEval](https://github.com/ScandEval/ScandEval).
51
 
52
- | task | dataset | summary |
53
- |:-------------------------|:-------------|:-------------------------------------------------------------------------------------------|
54
- | sentiment-classification | swerec | mcc = 63.02, mcc_se = 2.16, macro_f1 = 62.2, macro_f1_se = 3.61 |
55
- | sentiment-classification | angry-tweets | mcc = 47.21, mcc_se = 0.53, macro_f1 = 64.21, macro_f1_se = 0.53 |
56
- | sentiment-classification | norec | mcc = 42.23, mcc_se = 8.69, macro_f1 = 57.24, macro_f1_se = 7.67 |
57
- | named-entity-recognition | suc3 | micro_f1 = 50.03, micro_f1_se = 4.16, micro_f1_no_misc = 53.55, micro_f1_no_misc_se = 4.57 |
58
- | named-entity-recognition | dane | micro_f1 = 76.44, micro_f1_se = 1.36, micro_f1_no_misc = 80.61, micro_f1_no_misc_se = 1.11 |
59
- | named-entity-recognition | norne-nb | micro_f1 = 68.38, micro_f1_se = 1.72, micro_f1_no_misc = 73.08, micro_f1_no_misc_se = 1.66 |
60
- | named-entity-recognition | norne-nn | micro_f1 = 60.45, micro_f1_se = 1.71, micro_f1_no_misc = 64.39, micro_f1_no_misc_se = 1.8 |
61
- | linguistic-acceptability | scala-sv | mcc = 5.01, mcc_se = 5.41, macro_f1 = 49.46, macro_f1_se = 3.67 |
62
- | linguistic-acceptability | scala-da | mcc = 54.74, mcc_se = 12.22, macro_f1 = 76.25, macro_f1_se = 6.09 |
63
- | linguistic-acceptability | scala-nb | mcc = 19.18, mcc_se = 14.01, macro_f1 = 55.3, macro_f1_se = 8.85 |
64
- | linguistic-acceptability | scala-nn | mcc = 5.72, mcc_se = 5.91, macro_f1 = 49.56, macro_f1_se = 3.73 |
65
- | question-answering | scandiqa-da | em = 26.36, em_se = 1.17, f1 = 32.41, f1_se = 1.1 |
66
- | question-answering | scandiqa-no | em = 26.14, em_se = 1.59, f1 = 32.02, f1_se = 1.59 |
67
- | question-answering | scandiqa-sv | em = 26.38, em_se = 1.1, f1 = 32.33, f1_se = 1.05 |
68
- | speed | speed | speed = 4.55, speed_se = 0.0 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
49
 
50
  The performance of the pretrained model was evaluated using [ScandEval](https://github.com/ScandEval/ScandEval).
51
 
52
+ | Task | Dataset | Score (±SE) |
53
+ |:-------------------------|:-------------|:---------------------------------|
54
+ | sentiment-classification | swerec | mcc = 63.02 2.16) |
55
+ | | | macro_f1 = 62.2 (±3.61) |
56
+ | sentiment-classification | angry-tweets | mcc = 47.21 (±0.53) |
57
+ | | | macro_f1 = 64.21 (±0.53) |
58
+ | sentiment-classification | norec | mcc = 42.23 (±8.69) |
59
+ | | | macro_f1 = 57.24 (±7.67) |
60
+ | named-entity-recognition | suc3 | micro_f1 = 50.03 (±4.16) |
61
+ | | | micro_f1_no_misc = 53.55 (±4.57) |
62
+ | named-entity-recognition | dane | micro_f1 = 76.44 (±1.36) |
63
+ | | | micro_f1_no_misc = 80.61 (±1.11) |
64
+ | named-entity-recognition | norne-nb | micro_f1 = 68.38 (±1.72) |
65
+ | | | micro_f1_no_misc = 73.08 1.66) |
66
+ | named-entity-recognition | norne-nn | micro_f1 = 60.45 1.71) |
67
+ | | | micro_f1_no_misc = 64.39 1.8) |
68
+ | linguistic-acceptability | scala-sv | mcc = 5.01 (±5.41) |
69
+ | | | macro_f1 = 49.46 (±3.67) |
70
+ | linguistic-acceptability | scala-da | mcc = 54.74 (±12.22) |
71
+ | | | macro_f1 = 76.25 (±6.09) |
72
+ | linguistic-acceptability | scala-nb | mcc = 19.18 (±14.01) |
73
+ | | | macro_f1 = 55.3 (±8.85) |
74
+ | linguistic-acceptability | scala-nn | mcc = 5.72 (±5.91) |
75
+ | | | macro_f1 = 49.56 (±3.73) |
76
+ | question-answering | scandiqa-da | em = 26.36 (±1.17) |
77
+ | | | f1 = 32.41 (±1.1) |
78
+ | question-answering | scandiqa-no | em = 26.14 (±1.59) |
79
+ | | | f1 = 32.02 (±1.59) |
80
+ | question-answering | scandiqa-sv | em = 26.38 (±1.1) |
81
+ | | | f1 = 32.33 (±1.05) |
82
+ | speed | speed | speed = 4.55 (±0.0) |