NanditaP/PrivSec-Classifier

This is a binary classification model fine-tuned using the model 'bert-base-uncased'. It is built using a large Twitter dataset and is suitable especially for Twitter style data.

This can be used to classify the text into the categories of 'Privacy & Security' or 'Non-Privacy and Security'.

It achieved the following results on the evaluation set:

The validation scores for the module were as follows

Accuracy = 0.92

Class	Precision	Recall	F1-Score
PrivSec(0)	0.91	0.94	0.92
Non-PrivSec(1)	0.93	0.89	0.91

Paper: The paper detailing how it was designed can be found here Perspectives of non-expert users on cyber security and privacy: An analysis of online discussions on twitter

Please cite the paper if you use this model :

Nandita Pattnaik, Shujun Li, and Jason R.C. Nurse. 2023.
Perspectives of non-expert users on cyber security and privacy: An analysis of online discussions on Twitter.
Computers & Security 125 (2023), 103008. https://doi.org/10.1016/j.cose.2022.103008