deid_bert_i2b2 / README.md
mhomilius's picture
updated README
da3b4c6
# BERT-based deidentification model
This repo contains model weights trained on the I2B2 dataset. Note that the hosted inference API uses a different tokenizer than what we developed for this task.
Please see [OBI EHR deidentification](https://github.com/obi-ds/ehr_deidentification) for more details and how to get started.
---
language:
- en
thumbnail: "https://www.onebraveidea.org/wp-content/uploads/2019/07/OBI-Logo-Website.png"
tags:
- deidentification
- medical notes
license: "any valid license identifier"
datasets:
- I2B2
metrics:
- F1
- Recall
- AUC
widget:
- text: "Physician Discharge Summary Admit date: 10/12/1982 Discharge date: 10/22/1982 Patient Information Jack Smith, 54 y.o. male (DOB = 1/21/1928)."
- text: "Home Address: 123 Park Drive, San Diego, CA, 03245. Home Phone: 202-555-0199 (home)."
- text: "Hospital Care Team Service: Orthopedics Inpatient Attending: Roger C Kelly, MD Attending phys phone: (634)743-5135 Discharge Unit: HCS843 Primary Care Physician: Hassan V Kim, MD 512-832-5025."
---