SkyWater21 commited on
Commit
d578b41
1 Parent(s): c79e7a3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +100 -0
README.md CHANGED
@@ -1,3 +1,103 @@
1
  ---
2
  license: mit
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
+ datasets:
4
+ - SkyWater21/ru_go_emotions_ekman
5
+ - seara/ru_go_emotions
6
+ language:
7
+ - ru
8
  ---
9
+ Fine-tuned [rubert-base-cased](https://huggingface.co/DeepPavlov/rubert-base-cased) for multi-label emotion classification task.
10
+
11
+ Model was trained on [ru_go_emotions_ekman](https://huggingface.co/SkyWater21/ru_go_emotions_ekman) dataset.
12
+ Original translation of comments to Russian was done at [seara/ru_go_emotions](https://huggingface.co/datasets/seara/ru_go_emotions).
13
+ Dataset is Russian translation of [GoEmotions](https://huggingface.co/datasets/go_emotions) dataset.
14
+ Google Translate was used to generate the machine translation.
15
+
16
+ Original 26 emotions from GoEmotions were mapped to 6 base emotions as per Dr. Ekman theory.
17
+
18
+ Labels predicted by classifier:
19
+ ```yaml
20
+ 0: anger
21
+ 1: disgust
22
+ 2: fear
23
+ 3: joy
24
+ 4: sadness
25
+ 5: surprise
26
+ 6: neutral
27
+ ```
28
+
29
+ Label mapping from 27 emotions from GoEmotion to 6 base emotions as per Dr. Ekman theory:
30
+ |GoEmotion|Ekman|
31
+ |---|---|
32
+ | admiration | joy|
33
+ | amusement | joy|
34
+ | anger | anger|
35
+ | annoyance | anger|
36
+ | approval | joy|
37
+ | caring | joy|
38
+ | confusion | surprise|
39
+ | curiosity | surprise|
40
+ | desire | joy|
41
+ | disappointment | sadness|
42
+ | disapproval | anger|
43
+ | disgust | disgust|
44
+ | embarrassment | sadness|
45
+ | excitement | joy|
46
+ | fear | fear|
47
+ | gratitude | joy|
48
+ | grief | sadness|
49
+ | joy | joy|
50
+ | love | joy|
51
+ | nervousness | fear|
52
+ | optimism | joy|
53
+ | pride | joy|
54
+ | realization | surprise|
55
+ | relief | joy|
56
+ | remorse | sadness|
57
+ | sadness | sadness|
58
+ | surprise | surprise|
59
+ | neutral | neutral|
60
+
61
+ Seed used for random number generator is 42:
62
+ ```python
63
+ def set_seed(seed=42):
64
+ random.seed(seed)
65
+ np.random.seed(seed)
66
+ torch.manual_seed(seed)
67
+ if torch.cuda.is_available():
68
+ torch.cuda.manual_seed_all(seed)
69
+ ```
70
+
71
+ Training parameters:
72
+ ```yaml
73
+ max_length: null
74
+ batch_size: 32
75
+ shuffle: True
76
+ num_workers: 2
77
+ pin_memory: False
78
+ drop_last: False
79
+
80
+ optimizer: adam
81
+ lr: 0.00001
82
+ weight_decay: 0
83
+
84
+ problem_type: multi_label_classification
85
+
86
+ num_epochs: 4
87
+ ```
88
+
89
+
90
+ Evaluation results on test split of [ru_go_emotions_ekman](https://huggingface.co/datasets/SkyWater21/ru_go_emotions_ekman/viewer/simplified_ekman/test)
91
+ | |Precision|Recall|F1-Score|AUC-ROC|Support|
92
+ |--------------|---------|------|--------|-------|-------|
93
+ |anger | 0.56| 0.44| 0.49| 0.86| 726|
94
+ |disgust | 0.65| 0.24| 0.36| 0.92| 123|
95
+ |fear | 0.64| 0.60| 0.62| 0.93| 98|
96
+ |joy | 0.79| 0.80| 0.80| 0.91| 2104|
97
+ |sadness | 0.68| 0.44| 0.53| 0.89| 379|
98
+ |surprise | 0.60| 0.52| 0.56| 0.88| 677|
99
+ |neutral | 0.65| 0.58| 0.61| 0.82| 1787|
100
+ |micro avg | 0.69| 0.62| 0.65| 0.92| 5894|
101
+ |macro avg | 0.65| 0.52| 0.57| 0.89| 5894|
102
+ |weighted avg | 0.69| 0.62| 0.65| 0.87| 5894|
103
+ |samples avg | 0.65| 0.64| 0.64| nan| 5894|