astarostap/distilbert-cased-antisemitic-tweets

This model takes a tweet with the word "jew" in it, and determines if it's antisemitic.

Training data:

This model was trained on 4k tweets, where ~50% were labeled as antisemitic.

I labeled them myself based on personal experience and knowledge about common antisemitic tropes.

Note:

The goal for this model is not to be used as a final say on what is or is not antisemitic, but rather as a first pass on what might be antisemitic and should be reviewed by human experts.

Please keep in mind that I'm not an expert on antisemitism or hatespeech.

Whether something is antisemitic or not depends on the context, as for any hate speech, and everyone has a different definition for what is hate speech.

If you would like to collaborate on antisemitism detection, please feel free to contact me at starosta@alumni.stanford.edu

This model is not ready for production, it needs more evaluation and more training data.