deid_bert_i2b2 / README.md
prajwal967's picture
Trying a different aggregation strategy
15289d3
metadata
language:
  - en
thumbnail: https://www.onebraveidea.org/wp-content/uploads/2019/07/OBI-Logo-Website.png
tags:
  - deidentification
  - medical notes
datasets:
  - I2B2
metrics:
  - F1
  - Recall
  - AUC
inference:
  parameters:
    aggregation_strategy: first
widget:
  - text: >-
      Physician Discharge Summary Admit date: 10/12/1982 Discharge date:
      10/22/1982 Patient Information Jack Smith, 54 y.o. male (DOB = 1/21/1928).
  - text: >-
      Home Address: 123 Park Drive, San Diego, CA, 03245. Home Phone:
      202-555-0199 (home).
  - text: >-
      Hospital Care Team Service: Orthopedics Inpatient Attending: Roger C
      Kelly, MD Attending phys phone: (634)743-5135 Discharge Unit: HCS843
      Primary Care Physician: Hassan V Kim, MD 512-832-5025.

BERT-based deidentification model

This repo contains model weights for clinical note de-deidentification trained on the I2B2 dataset. Note that the hosted inference API uses a different tokenizer than what we developed for this task. Please see OBI EHR deidentification for more details and how to get started.