language: | |
- en | |
thumbnail: "https://www.onebraveidea.org/wp-content/uploads/2019/07/OBI-Logo-Website.png" | |
tags: | |
- deidentification | |
- medical notes | |
datasets: | |
- I2B2 | |
metrics: | |
- F1 | |
- Recall | |
- AUC | |
inference: | |
parameters: | |
aggregation_strategy: "first" | |
widget: | |
- text: "Physician Discharge Summary Admit date: 10/12/1982 Discharge date: 10/22/1982 Patient Information Jack Smith, 54 y.o. male (DOB = 1/21/1928)." | |
- text: "Home Address: 123 Park Drive, San Diego, CA, 03245. Home Phone: 202-555-0199 (home)." | |
- text: "Hospital Care Team Service: Orthopedics Inpatient Attending: Roger C Kelly, MD Attending phys phone: (634)743-5135 Discharge Unit: HCS843 Primary Care Physician: Hassan V Kim, MD 512-832-5025." | |
# BERT-based deidentification model | |
This repo contains model weights for clinical note de-deidentification trained on the I2B2 dataset. Note that the hosted inference API uses a different tokenizer than what we developed for this task. | |
Please see [OBI EHR deidentification](https://github.com/obi-ds/ehr_deidentification) for more details and how to get started. | |