samant commited on
Commit
7fdcf5d
·
verified ·
1 Parent(s): 59c2c34

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +75 -0
README.md ADDED
@@ -0,0 +1,75 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ metrics:
5
+ - precision
6
+ - recall
7
+ - f1-score
8
+ - support
9
+ base_model:
10
+ - google-bert/bert-base-uncased
11
+ tags:
12
+ - medical
13
+ - NER
14
+ - healthcare
15
+ ---
16
+ # Model Card for Medical NER Model
17
+
18
+ This model is a high-performance Named Entity Recognition (NER) model designed specifically for medical text. It identifies entities such as diseases, symptoms, procedures, medications, and healthcare providers with high precision and recall, making it ideal for clinical and healthcare applications.
19
+
20
+
21
+ ### Model Description
22
+
23
+
24
+ - **Model type:** Named Entity Recognition (NER)
25
+ - **Language(s) (NLP):** English (en)
26
+ - **Finetuned from model :** google-bert/bert-base-uncased
27
+
28
+ This model has been fine-tuned on a medical dataset to achieve high accuracy in extracting key entities from healthcare documents.
29
+
30
+ ## Uses
31
+
32
+ ### Direct Use
33
+
34
+ This model is intended for extracting medical entities from clinical or healthcare-related text. It can be used for:
35
+ - Automating medical document analysis.
36
+ - Structuring unstructured healthcare data.
37
+ - Supporting healthcare AI applications.
38
+
39
+ ### Downstream Use
40
+
41
+ The model can be further fine-tuned for:
42
+ - Specific medical subdomains.
43
+ - Entity classification in multi-lingual healthcare datasets.
44
+
45
+ ### Out-of-Scope Use
46
+
47
+ This model is not designed for:
48
+ - General NER tasks outside the healthcare domain.
49
+ - Identifying non-medical entities or handling multi-lingual text without fine-tuning.
50
+
51
+ ## Bias, Risks, and Limitations
52
+
53
+ ### Risks:
54
+ - Misclassification of entities in ambiguous or highly noisy text.
55
+ - Potential underperformance on unseen medical terms or rare conditions.
56
+
57
+ ### Limitations:
58
+ - The model is optimized for English and may not perform well on other languages.
59
+ - It requires clean text inputs with minimal OCR errors for best performance.
60
+
61
+ ### Recommendations
62
+ Users should validate extracted entities before use in critical applications, such as medical decision-making.
63
+
64
+ ## How to Get Started with the Model
65
+
66
+ ```python
67
+ from transformers import pipeline
68
+
69
+ # Load the model
70
+ ner_pipeline = pipeline("ner", model="samant/medical-ner")
71
+
72
+ # Example usage
73
+ text = "The patient has been diagnosed with Type 2 Diabetes and prescribed Metformin."
74
+ entities = ner_pipeline(text)
75
+ print(entities)