README.md · malaysia-ai/malaysian-sfw-classifier at 24171afcf5b6b10087f4d17b8093205ee0d1db20

metadata

language:
  - ms
library_name: transformers

Safe for Work Classifier Model for Malaysian Data

Current version supports Malay. We are working towards supporting malay, english and indo.

Base Model finetuned from https://huggingface.co/mesolitica/malaysian-mistral-191M-MLM-512 with Malaysian NSFW data.

Data Source: https://huggingface.co/datasets/malaysia-ai/Malaysian-NSFW

Github Repo: https://github.com/malaysia-ai/sfw-classifier

Project Board: https://github.com/orgs/malaysia-ai/projects/6

Current Labels Available:

religion insult
sexist
racist
psychiatric or mental illness
harassment
safe for work
porn
self-harm

How to use

from classifier import MistralForSequenceClassification
model = MistralForSequenceClassification.from_pretrained('malaysia-ai/malaysian-sfw-classifier')

                                precision    recall  f1-score   support

                       racist    0.88481   0.91264   0.89851      1717
              religion insult    0.86248   0.86753   0.86500      3246
psychiatric or mental illness    0.92863   0.83983   0.88200      5825
                       sexist    0.76152   0.74819   0.75480      1656
                   harassment    0.59621   0.86080   0.70448      1717
                         porn    0.96332   0.97697   0.97010      1129
                safe for work    0.90178   0.83741   0.86840      3881
                    self-harm    0.89489   0.92647   0.91040       340

                     accuracy                        0.85388     19511
                    macro avg    0.84920   0.87123   0.85671     19511
                 weighted avg    0.86641   0.85388   0.85709     19511