Wojood - Nested/Flat Arabic NER Models

Wojood is a corpus for Arabic nested Named Entity Recognition (NER). Nested entities occur when one entity mention is embedded inside another entity mention. 550K tokens (MSA and dialect) This repo contains the source-code to train Wojood nested NER.

Online Demo You can try our model using the demo link below





  • Nested NER (main branch), with micro-F1 score of 0.909551
  • Flat NER (flat branch), with micro-F1 score 0.883847

Google Colab Notebooks

You can test our model using our Google Colab notebooks

