# BERT-based deidentification model This repo contains model weights trained on the I2B2 dataset. Note that the hosted inference API uses a different tokenizer than what we developed for this task. Please see [OBI EHR deidentification](https://github.com/obi-ds/ehr_deidentification) for more details and how to get started. --- language: - en thumbnail: "https://www.onebraveidea.org/wp-content/uploads/2019/07/OBI-Logo-Website.png" tags: - deidentification - medical notes license: "any valid license identifier" datasets: - I2B2 metrics: - F1 - Recall - AUC widget: - text: "Physician Discharge Summary Admit date: 10/12/1982 Discharge date: 10/22/1982 Patient Information Jack Smith, 54 y.o. male (DOB = 1/21/1928)." - text: "Home Address: 123 Park Drive, San Diego, CA, 03245. Home Phone: 202-555-0199 (home)." - text: "Hospital Care Team Service: Orthopedics Inpatient Attending: Roger C Kelly, MD Attending phys phone: (634)743-5135 Discharge Unit: HCS843 Primary Care Physician: Hassan V Kim, MD 512-832-5025." ---