language: | |
- en | |
pipeline_tag: token-classification | |
tags: | |
- medical | |
Protected health information (PHI) anonymization tool. Fine-tuned on the [i2b2 2014 training dataset](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4989908/) from the pretrained `bert-base-cased` model. | |
Anonymizes according to the i2b2 2014 standard, including all ages, locations and organizations, dates (including lone years), names, professions, identification numbers, and contact information. | |
Model released with the approval of Informatics for Integrating Biology & the Bedside. |