license: cc-by-nc-nd-4.0
This model is a multi-class classifier, model fine-tuned using the model 'bert-base-uncased'.
It is built around a large corpus of Twitter users' metadata.
It filters the data into 3 main categories - (1) Non-ExpertUser (2) ExpertUser (3) Other. The aim of this project was to find out whether a tweet belongs to an individual or not. And if it is, whether the person is an expert in the field of Security and Privacy.
Originally, the Model had 4 classes - where the 'Other' field was classified into 'Non-Person' (denoting accounts such as organizations)and 'Unknown'.
Since the main aim was to find out about whether a user is a non-expert user or not, the classes were reduced to 3 classes in this version 2.
The validation scores for the module were as follows
Accuracy = 0.93
Class | Precision | Recall | F1-Score |
---|---|---|---|
ExpertUser (0) | 0.88 | 0.90 | 0.89 |
Non-ExpertUser (1) | 0.95 | 0.97 | 0.96 |
Other (2) | 0.85 | 0.78 | 0.81 |
Paper: The paper detailing how it was designed can be found here Perspectives of non-expert users on cyber security and privacy: An analysis of online discussions on twitter
Please cite the paper if you use this model :
Nandita Pattnaik, Shujun Li, and Jason R.C. Nurse. 2023.
Perspectives of non-expert users on cyber security and
privacy: An analysis of online discussions on Twitter.
Computers & Security 125 (2023), 103008. https://doi.org/10.1016/j.cose.2022.103008