--- license: mit datasets: - SkyWater21/ru_go_emotions_ekman - seara/ru_go_emotions language: - ru --- Fine-tuned [rubert-base-cased](https://huggingface.co/DeepPavlov/rubert-base-cased) for multi-label emotion classification task. Model was trained on [ru_go_emotions_ekman](https://huggingface.co/SkyWater21/ru_go_emotions_ekman) dataset. Original translation of comments to Russian was done at [seara/ru_go_emotions](https://huggingface.co/datasets/seara/ru_go_emotions). Dataset is Russian translation of [GoEmotions](https://huggingface.co/datasets/go_emotions) dataset. Google Translate was used to generate the machine translation. Original 26 emotions from GoEmotions were mapped to 6 base emotions as per Dr. Ekman theory. Labels predicted by classifier: ```yaml 0: anger 1: disgust 2: fear 3: joy 4: sadness 5: surprise 6: neutral ``` Label mapping from 27 emotions from GoEmotion to 6 base emotions as per Dr. Ekman theory: |GoEmotion|Ekman| |---|---| | admiration | joy| | amusement | joy| | anger | anger| | annoyance | anger| | approval | joy| | caring | joy| | confusion | surprise| | curiosity | surprise| | desire | joy| | disappointment | sadness| | disapproval | anger| | disgust | disgust| | embarrassment | sadness| | excitement | joy| | fear | fear| | gratitude | joy| | grief | sadness| | joy | joy| | love | joy| | nervousness | fear| | optimism | joy| | pride | joy| | realization | surprise| | relief | joy| | remorse | sadness| | sadness | sadness| | surprise | surprise| | neutral | neutral| Seed used for random number generator is 42: ```python def set_seed(seed=42): random.seed(seed) np.random.seed(seed) torch.manual_seed(seed) if torch.cuda.is_available(): torch.cuda.manual_seed_all(seed) ``` Training parameters: ```yaml max_length: null batch_size: 32 shuffle: True num_workers: 2 pin_memory: False drop_last: False optimizer: adam lr: 0.00001 weight_decay: 0 problem_type: multi_label_classification num_epochs: 4 ``` Evaluation results on test split of [ru_go_emotions_ekman](https://huggingface.co/datasets/SkyWater21/ru_go_emotions_ekman/viewer/simplified_ekman/test) | |Precision|Recall|F1-Score|AUC-ROC|Support| |--------------|---------|------|--------|-------|-------| |anger | 0.56| 0.44| 0.49| 0.86| 726| |disgust | 0.65| 0.24| 0.36| 0.92| 123| |fear | 0.64| 0.60| 0.62| 0.93| 98| |joy | 0.79| 0.80| 0.80| 0.91| 2104| |sadness | 0.68| 0.44| 0.53| 0.89| 379| |surprise | 0.60| 0.52| 0.56| 0.88| 677| |neutral | 0.65| 0.58| 0.61| 0.82| 1787| |micro avg | 0.69| 0.62| 0.65| 0.92| 5894| |macro avg | 0.65| 0.52| 0.57| 0.89| 5894| |weighted avg | 0.69| 0.62| 0.65| 0.87| 5894| |samples avg | 0.65| 0.64| 0.64| nan| 5894|