--- license: mit language: - multilingual library_name: transformers pipeline_tag: text-classification widget: - text: "You wont believe what happened to me today" - text: "You wont believe what happened to me today!" - text: "You wont believe what happened to me today..." - text: "You wont believe what happened to me today <3" - text: "You wont believe what happened to me today :)" - text: "You wont believe what happened to me today :(" --- This is an emotion classification model based on fine-tuning of a Bernice model, which is a pre-trained model trained on multilingual Twitter data. The fine-tuning dataset is a subset of the self-labeled emotion dataset (Lykousas et al., 2019) in English that corresponds to Anger, Fear, Sadness, Joy, and Affection. See the paper, [LEIA: Linguistic Embeddings for the Identification of Affect](https://doi.org/10.1140/epjds/s13688-023-00427-0) for further details. ## Evaluation We evaluated LEIA-multilingual on posts with self-annotated emotion labels identified as non-English using an ensemble of language identification tools. The table below shows the macro-F1 scores aggregated across emotion categories for each language: |Language|Macro-F1| |:---:|:---:| |ar |44.18[43.07,45.29]| |da |65.44[60.96,69.83] | |de |60.47[57.58,63.38] | |es |61.67[60.79,62.55] | |fi |45.1[40.96,49.14] | |fr |65.78[63.19,68.36] | |it |63.37[59.67,67.1] | |pt |57.27[55.15,59.4] | |tl |58.37[55.51,61.23] | |tr |45.42[41.17,49.79]| ## Citation Please cite the following paper if you find the model useful for your work: ```bibtex @article{aroyehun2023leia, title={LEIA: Linguistic Embeddings for the Identification of Affect}, author={Aroyehun, Segun Taofeek and Malik, Lukas and Metzler, Hannah and Haimerl, Nikolas and Di Natale, Anna and Garcia, David}, journal={EPJ Data Science}, volume={12}, year={2023}, publisher={Springer} } ```