--- language: - en pipeline_tag: token-classification tags: - medical --- Protected health information (PHI) anonymization tool. Fine-tuned on the [i2b2 2014 training dataset](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4989908/) from the pretrained `bert-base-cased` model. Anonymizes according to the i2b2 2014 standard, including all ages, locations and organizations, dates (including lone years), names, professions, identification numbers, and contact information. Model released with the approval of Informatics for Integrating Biology & the Bedside.