Sentiment Analysis in Spanish


Repository: https://github.com/pysentimiento/pysentimiento/

Model trained with TASS 2020 corpus (around ~5k tweets) of several dialects of Spanish. Base model is RoBERTuito, a RoBERTa model trained in Spanish tweets.

Uses POS, NEG, NEU labels.


Use it directly with pysentimiento

from pysentimiento import create_analyzer
analyzer = create_analyzer(task="sentiment", lang="es")

analyzer.predict("QuΓ© gran jugador es Messi")
# returns AnalyzerOutput(output=POS, probas={POS: 0.998, NEG: 0.002, NEU: 0.000})


Results for the four tasks evaluated in pysentimiento. Results are expressed as Macro F1 scores

model emotion hate_speech irony sentiment
robertuito 0.560 Β± 0.010 0.759 Β± 0.007 0.739 Β± 0.005 0.705 Β± 0.003
roberta 0.527 Β± 0.015 0.741 Β± 0.012 0.721 Β± 0.008 0.670 Β± 0.006
bertin 0.524 Β± 0.007 0.738 Β± 0.007 0.713 Β± 0.012 0.666 Β± 0.005
beto_uncased 0.532 Β± 0.012 0.727 Β± 0.016 0.701 Β± 0.007 0.651 Β± 0.006
beto_cased 0.516 Β± 0.012 0.724 Β± 0.012 0.705 Β± 0.009 0.662 Β± 0.005
mbert_uncased 0.493 Β± 0.010 0.718 Β± 0.011 0.681 Β± 0.010 0.617 Β± 0.003
biGRU 0.264 Β± 0.007 0.592 Β± 0.018 0.631 Β± 0.011 0.585 Β± 0.011

Note that for Hate Speech, these are the results for Semeval 2019, Task 5 Subtask B


