Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

cardiffnlp/twitter-xlm-roberta-base-hate-spanish

This model is a fine-tuned version of cardiffnlp/twitter-xlm-roberta-base using the HaterNet dataset and the Spanish subset of SemEval-2019 Task 5.

Following metrics are achieved

  • on the test split of SemEval-2019 Task 5

    • F1 (weighted): 0.7866
    • F1 (macro): 0.7935
    • Accuracy: 0.7937
  • on custom test split of Haternet

    • F1 (weighted): 0.7815
    • F1 (macro): 0.6981
    • Accuracy: 0.7933
  • on Haternet & SemEval-2019 Task 5

    • F1 (weighted): 0.7908
    • F1 (macro): 0.7657
    • Accuracy: 0.7936

Usage

Install tweetnlp via pip.

pip install tweetnlp

Load the model in python.

import tweetnlp
model = tweetnlp.Classifier("cardiffnlp/twitter-xlm-roberta-base-hate-spanish")
model.predict('Ismael es egocentrico porque se vuelve loca si le dicen que tiene el pelo bonito๐Ÿ˜‚๐Ÿ˜‚๐Ÿ˜‚๐Ÿ˜‚ eso se define con otro objetivo #FirstDates251')
>> {'label': 'NOT-HATE'}

Datasets

@inproceedings{basile-etal-2019-semeval, title = "{S}em{E}val-2019 Task 5: Multilingual Detection of Hate Speech Against Immigrants and Women in {T}witter", author = "Basile, Valerio and Bosco, Cristina and Fersini, Elisabetta and Nozza, Debora and Patti, Viviana and Rangel Pardo, Francisco Manuel and Rosso, Paolo and Sanguinetti, Manuela", booktitle = "Proceedings of the 13th International Workshop on Semantic Evaluation", month = jun, year = "2019", address = "Minneapolis, Minnesota, USA", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/S19-2007", doi = "10.18653/v1/S19-2007", pages = "54--63", }

@article{quijano2019haternet, title={HaterNet a system for detecting and analyzing hate speech in Twitter (Version 1.0)[Data set]}, author={Quijano-Sanchez, Lara and Kohatsu, Juan Carlos Pereira and Liberatore, Federico and Camacho-Collados, Miguel}, journal={Zenodo}, year={2019} }

Downloads last month
18
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Collection including cardiffnlp/twitter-xlm-roberta-base-hate-spanish