README.md · amengemeda/amharic-hate-speech-detection-mBERT at 2d4a8380211d336d6b937e0c67bc41026736011e

Amharic Hate Speech Detection using Fine-tuned mBERT

Model description

This model was created by finetuning the mBERT model for the downstream task of Hate speech detection for the Amharic language. The initial mBERT model used for finetuning is Davlan/bert-base-multilingual-cased-finetuned-amharic which was provided by Davlan on Huggingface. The model was fine-tuned using HuggingFace's Trainer API. The final result of the finetuning has an F1-score of 0.9172 and an accuracy of 91.59%.

Dataset description The finetuning was done on an Amharic Dataset that was made available by Mendeley Data (https://data.mendeley.com/datasets/ymtmxx385m). It has a size of 30,000 rows.

Other The Google Colab notebook is made available on my GitHub. Check this path https://github.com/amengemeda/ISproject-2/blob/main/mBERT/Amharic_Hate_Speech_detection_using_mBERT_(Trainer_API).ipynb

amengemeda
/

amharic-hate-speech-detection-mBERT

language: - amh tags: - amharic - hate speech - sentiment analysis datasets: - https://data.mendeley.com/datasets/ymtmxx385m metrics: - F1 - Accuracy