Upload 6 files

Browse files

Files changed (6) hide show

suicide/README.md +114 -0
suicide/config.json +31 -0
suicide/gitattributes +35 -0
suicide/special_tokens_map.json +7 -0
suicide/tokenizer.json +0 -0
suicide/vocab.txt +0 -0

suicide/README.md ADDED Viewed

	@@ -0,0 +1,114 @@

+---
+license: cc0-1.0
+language:
+- en
+metrics:
+- accuracy: 0.939432
+- recall: 0.937164
+- precision: 0.92822
+- f1: 0.92822
+tags:
+- classification
+- suicidality
+- suicidal text detection
+- suicidal sentiment
+- sentiment
+- suicide
+- self harm
+- depression
+pipeline_tag: text-classification
+---
+# Advanced Suicidality Classifier Model
+## Introduction
+Welcome to the Suicidality Detection AI Model! This project aims to provide a machine learning solution for detecting sequences of words indicative of suicidality in text. By utilizing the ELECTRA architecture and fine-tuning on a diverse dataset, we have created a powerful classification model that can distinguish between suicidal and non-suicidal text expressions.
+## Labels
+The model classifies input text into two labels:
+- `LABEL_0`: Indicates that the text is non-suicidal.
+- `LABEL_1`: Indicates that the text is indicative of suicidality.
+## Training
+The model was fine-tuned using the ELECTRA architecture on a carefully curated dataset. Our training process involved cleaning and preprocessing various text sources to create a comprehensive training set. The training results indicate promising performance, with metrics including:
+## Performance
+The model's performance on the validation dataset is as follows:
+- Accuracy: 0.939432
+- Recall: 0.937164
+- Precision: 0.92822
+- F1 Score: 0.932672
+These metrics demonstrate the model's ability to accurately classify sequences of text as either indicative of suicidality or non-suicidal.
+## Data Sources
+We collected data from multiple sources to create a rich and diverse training dataset:
+- https://www.kaggle.com/datasets/thedevastator/c-ssrs-labeled-suicidality-in-500-anonymized-red
+- https://www.kaggle.com/datasets/amangoyl/reddit-dataset-for-multi-task-nlp
+- https://www.kaggle.com/datasets/imeshsonu/suicideal-phrases
+- https://raw.githubusercontent.com/laxmimerit/twitter-suicidal-intention-dataset/master/twitter-suicidal_data.csv
+- https://www.kaggle.com/datasets/mohanedmashaly/suicide-notes
+- https://www.kaggle.com/datasets/natalialech/suicidal-ideation-on-twitter
+The data underwent thorough cleaning and preprocessing before being used for training the model.
+## How to Use
+### Installation
+To use the model, you need to install the Transformers library:
+```bash
+pip install transformers
+```
+### Using the Model
+You can utilize the model for text classification using the following code snippets:
+1. Using the pipeline approach:
+```python
+from transformers import pipeline
+classifier = pipeline("sentiment-analysis", model="sentinetyd/suicidality")
+result = classifier("text to classify")
+print(result)
+```
+2. Using the tokenizer and model programmatically:
+```python
+from transformers import AutoTokenizer, AutoModel
+tokenizer = AutoTokenizer.from_pretrained("sentinetyd/suicidality")
+model = AutoModel.from_pretrained("sentinetyd/suicidality")
+# Perform tokenization and prediction using the tokenizer and model
+```
+## Ethical Considerations
+Suicidality is a sensitive and serious topic. It's important to exercise caution and consider ethical implications when using this model. Predictions made by the model should be handled with care and used to complement human judgment and intervention.
+## Model Credits
+We would like to acknowledge the "gooohjy/suicidal-electra" model available on Hugging Face's model repository. You can find the model at [this link](https://huggingface.co/gooohjy/suicidal-electra). We used this model as a starting point and fine-tuned it to create our specialized suicidality detection model.
+## Contributions
+We welcome contributions and feedback from the community to further improve the model's performance, enhance the dataset, and ensure its responsible deployment.

suicide/config.json ADDED Viewed

	@@ -0,0 +1,31 @@

+{
+  "_name_or_path": "gooohjy/suicidal-electra",
+  "architectures": [
+    "ElectraForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "classifier_dropout": null,
+  "embedding_size": 768,
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "layer_norm_eps": 1e-12,
+  "max_position_embeddings": 512,
+  "model_type": "electra",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 0,
+  "position_embedding_type": "absolute",
+  "problem_type": "single_label_classification",
+  "summary_activation": "gelu",
+  "summary_last_dropout": 0.1,
+  "summary_type": "first",
+  "summary_use_proj": true,
+  "torch_dtype": "float32",
+  "transformers_version": "4.31.0",
+  "type_vocab_size": 2,
+  "use_cache": true,
+  "vocab_size": 30522
+}

suicide/gitattributes ADDED Viewed

	@@ -0,0 +1,35 @@

+*.7z filter=lfs diff=lfs merge=lfs -text
+*.arrow filter=lfs diff=lfs merge=lfs -text
+*.bin filter=lfs diff=lfs merge=lfs -text
+*.bz2 filter=lfs diff=lfs merge=lfs -text
+*.ckpt filter=lfs diff=lfs merge=lfs -text
+*.ftz filter=lfs diff=lfs merge=lfs -text
+*.gz filter=lfs diff=lfs merge=lfs -text
+*.h5 filter=lfs diff=lfs merge=lfs -text
+*.joblib filter=lfs diff=lfs merge=lfs -text
+*.lfs.* filter=lfs diff=lfs merge=lfs -text
+*.mlmodel filter=lfs diff=lfs merge=lfs -text
+*.model filter=lfs diff=lfs merge=lfs -text
+*.msgpack filter=lfs diff=lfs merge=lfs -text
+*.npy filter=lfs diff=lfs merge=lfs -text
+*.npz filter=lfs diff=lfs merge=lfs -text
+*.onnx filter=lfs diff=lfs merge=lfs -text
+*.ot filter=lfs diff=lfs merge=lfs -text
+*.parquet filter=lfs diff=lfs merge=lfs -text
+*.pb filter=lfs diff=lfs merge=lfs -text
+*.pickle filter=lfs diff=lfs merge=lfs -text
+*.pkl filter=lfs diff=lfs merge=lfs -text
+*.pt filter=lfs diff=lfs merge=lfs -text
+*.pth filter=lfs diff=lfs merge=lfs -text
+*.rar filter=lfs diff=lfs merge=lfs -text
+*.safetensors filter=lfs diff=lfs merge=lfs -text
+saved_model/**/* filter=lfs diff=lfs merge=lfs -text
+*.tar.* filter=lfs diff=lfs merge=lfs -text
+*.tar filter=lfs diff=lfs merge=lfs -text
+*.tflite filter=lfs diff=lfs merge=lfs -text
+*.tgz filter=lfs diff=lfs merge=lfs -text
+*.wasm filter=lfs diff=lfs merge=lfs -text
+*.xz filter=lfs diff=lfs merge=lfs -text
+*.zip filter=lfs diff=lfs merge=lfs -text
+*.zst filter=lfs diff=lfs merge=lfs -text
+*tfevents* filter=lfs diff=lfs merge=lfs -text

suicide/special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

suicide/tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

suicide/vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff