Update README.md
Browse files
README.md
CHANGED
@@ -3,6 +3,10 @@ license: apache-2.0
|
|
3 |
base_model: bert-base-uncased
|
4 |
tags:
|
5 |
- generated_from_trainer
|
|
|
|
|
|
|
|
|
6 |
metrics:
|
7 |
- accuracy
|
8 |
- precision
|
@@ -11,6 +15,10 @@ metrics:
|
|
11 |
model-index:
|
12 |
- name: bert-drug-review-to-condition
|
13 |
results: []
|
|
|
|
|
|
|
|
|
14 |
---
|
15 |
|
16 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -28,18 +36,23 @@ It achieves the following results on the evaluation set:
|
|
28 |
|
29 |
## Model description
|
30 |
|
31 |
-
|
32 |
|
33 |
## Intended uses & limitations
|
34 |
|
35 |
-
|
36 |
|
37 |
## Training and evaluation data
|
38 |
|
39 |
-
|
|
|
|
|
|
|
40 |
|
41 |
## Training procedure
|
42 |
|
|
|
|
|
43 |
### Training hyperparameters
|
44 |
|
45 |
The following hyperparameters were used during training:
|
@@ -65,4 +78,4 @@ The following hyperparameters were used during training:
|
|
65 |
- Transformers 4.40.0
|
66 |
- Pytorch 2.2.1+cu121
|
67 |
- Datasets 2.19.0
|
68 |
-
- Tokenizers 0.19.1
|
|
|
3 |
base_model: bert-base-uncased
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
+
- medical
|
7 |
+
- biology
|
8 |
+
- text-classification
|
9 |
+
- multiclass classification
|
10 |
metrics:
|
11 |
- accuracy
|
12 |
- precision
|
|
|
15 |
model-index:
|
16 |
- name: bert-drug-review-to-condition
|
17 |
results: []
|
18 |
+
datasets:
|
19 |
+
- Zakia/drugscom_reviews
|
20 |
+
language:
|
21 |
+
- en
|
22 |
---
|
23 |
|
24 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
36 |
|
37 |
## Model description
|
38 |
|
39 |
+
"bert-base-uncased" fine-tuned for text-classification (multiclass): from input text, the model outputs the most likely medical pathology of the person. Training based on predicting 'condition' feature from 'review' feature (i.e., the person reviews the drugs they are taking for their condition)
|
40 |
|
41 |
## Intended uses & limitations
|
42 |
|
43 |
+
Personal project
|
44 |
|
45 |
## Training and evaluation data
|
46 |
|
47 |
+
The 100 most frequent conditions of the dataset are selected:
|
48 |
+
{0: 'multiple sclerosis', 1: 'overactive bladde', 2: 'hyperhidrosis', 3: 'ibromyalgia', 4: 'menstrual disorders', 5: 'hypogonadism, male', 6: 'rosacea', 7: 'muscle spasm', 8: 'high blood pressure', 9: 'epilepsy', 10: 'psoriatic arthritis', 11: 'post traumatic stress disorde', 12: 'smoking cessation', 13: 'not listed / othe', 14: 'herpes simplex', 15: 'opiate dependence', 16: 'social anxiety disorde', 17: 'urticaria', 18: 'allergic rhinitis', 19: 'polycystic ovary syndrome', 20: 'obsessive compulsive disorde', 21: 'depression', 22: 'migraine prevention', 23: 'neuropathic pain', 24: 'ankylosing spondylitis', 25: 'skin or soft tissue infection', 26: 'constipation, drug induced', 27: 'obesity', 28: 'vaginal yeast infection', 29: 'osteoarthritis', 30: 'restless legs syndrome', 31: 'plaque psoriasis', 32: 'panic disorde', 33: 'abnormal uterine bleeding', 34: 'adhd', 35: 'high cholesterol', 36: 'diabetes, type 2', 37: 'anxiety and stress', 38: 'asthma, maintenance', 39: 'pneumonia', 40: 'schizophrenia', 41: 'opiate withdrawal', 42: 'osteoporosis', 43: 'influenza', 44: 'weight loss', 45: 'cough and nasal congestion', 46: 'birth control', 47: 'benign prostatic hyperplasia', 48: 'helicobacter pylori infection', 49: 'anxiety', 50: 'bronchitis', 51: 'rheumatoid arthritis', 52: 'narcolepsy', 53: 'generalized anxiety disorde', 54: 'insomnia', 55: 'nasal congestion', 56: 'major depressive disorde', 57: 'schizoaffective disorde', 58: 'psoriasis', 59: 'premenstrual dysphoric disorde', 60: 'bacterial vaginitis', 61: 'motion sickness', 62: 'erectile dysfunction', 63: 'constipation, chronic', 64: 'copd, maintenance', 65: 'back pain', 66: 'alcohol dependence', 67: 'migraine', 68: 'bladder infection', 69: 'underactive thyroid', 70: 'ulcerative colitis', 71: 'chronic pain', 72: 'hiv infection', 73: 'cold sores', 74: 'breast cance', 75: 'bipolar disorde', 76: 'irritable bowel syndrome', 77: 'anesthesia', 78: 'onychomycosis, toenail', 79: 'chlamydia infection', 80: 'gerd', 81: 'endometriosis', 82: 'seizures', 83: 'alcohol withdrawal', 84: 'bowel preparation', 85: 'hot flashes', 86: 'bacterial infection', 87: 'inflammatory conditions', 88: 'constipation', 89: 'headache', 90: 'urinary tract infection', 91: 'sinusitis', 92: 'emergency contraception', 93: 'cough', 94: 'acne', 95: 'atrial fibrillation', 96: 'pain', 97: 'nausea/vomiting', 98: 'hepatitis c', 99: 'postmenopausal symptoms'}
|
49 |
+
The 'review' feature is lowercased and are only selected examples with more than 16 characters.
|
50 |
+
|
51 |
|
52 |
## Training procedure
|
53 |
|
54 |
+
See code available at: https://github.com/mlafuentem/Marcuswas-bert-drug-review-to-condition/blob/main/Exercise_classification_conditions_code.ipynb
|
55 |
+
|
56 |
### Training hyperparameters
|
57 |
|
58 |
The following hyperparameters were used during training:
|
|
|
78 |
- Transformers 4.40.0
|
79 |
- Pytorch 2.2.1+cu121
|
80 |
- Datasets 2.19.0
|
81 |
+
- Tokenizers 0.19.1
|