mentBERT / README.md
reab5555's picture
Update README.md
b4d3c90 verified
---
license: apache-2.0
tags:
- generated_from_trainer
metrics:
- f1
- auc
model-index:
- name: pretrained_model
results:
- task:
name: Text Classification
type: text-classification
metrics:
- name: F1
type: f1
value: 0.6797
- name: AUC
type: auc
value: 0.7942
widget:
- text: "I have trouble understanding what other people think or feel. I also like numbers, and finding patterns in numbers."
---
This model is a hybrid fine-tuned version of distilbert-base-uncased on Reddit dataset contains text related to mental health reports of users. it predicts mental health disorders from textual content.
It achieves the following results on the validation set:
* Loss: 0.1873
* F1: 0.6797
* AUC: 0.7942
* Precision: 0.7731
# Description
This model is a finetuned BERT (bert-base-uncased) model that predict different mental disorders.
* It is trained on a costume dataset of texts or posts (from Reddit) about general experiences of users with mental health problems.
* Dataset was cleaned and all direct mentions of the disorder names in the texts were removed.
It includes the following classes:
* Borderline
* Anxiety
* Depression
* Bipolar
* OCD
* ADHD
* Schizophrenia
* Asperger
* PTSD
# Training
Train size: 90%
Val size: 10%
Training set class counts (text samples) after balancing:
Borderline: 10398
Anxiety: 10393
Depression: 10400
Bipolar: 10359
OCD: 10413
ADHD: 10412
Schizophrenia: 10447
Asperger: 10470
PTSD: 10489
Validation set class counts after balancing:
Borderline: 1180
Anxiety: 1185
Depression: 1178
Bipolar: 1219
OCD: 1165
ADHD: 1166
Schizophrenia: 1131
Asperger: 1108
PTSD: 1089
model-finetuning: bert-base-uncased
The following hyperparameters were used during training:
learning_rate: 5e-05
train_batch_size: 32
val_batch_size: 32
optimizer: AdamW
num_epochs: 2-3
# Training results
| Epoch | Training Loss | Validation Loss |
|-------|---------------|-----------------|
| 1.0 | 0.2089 | 0.1771 |
| 2.0 | 0.1525 | 0.1716 |
F1 Score: 0.6797
AUC Score: 0.7942
## Classification Report
Borderline:
Precision: 0.6682
Recall: 0.5923
F1-score: 0.6280
Anxiety:
Precision: 0.6620
Recall: 0.6497
F1-score: 0.6558
Depression:
Precision: 0.7261
Recall: 0.5424
F1-score: 0.6209
Bipolar:
Precision: 0.8055
Recall: 0.5233
F1-score: 0.6345
OCD:
Precision: 0.8200
Recall: 0.6532
F1-score: 0.7271
ADHD:
Precision: 0.8740
Recall: 0.6603
F1-score: 0.7523
Schizophrenia:
Precision: 0.8017
Recall: 0.6472
F1-score: 0.7162
Asperger:
Precision: 0.7368
Recall: 0.6570
F1-score: 0.6946
PTSD:
Precision: 0.8612
Recall: 0.5812
F1-score: 0.6940