xeroISB
/

incidentImpactModel

Text Classification

Inference Endpoints

Model card Files Files and versions Community

incidentImpactModel / README.md

xeroISB's picture

Update README.md

d5e8202 verified 12 days ago

|

history blame contribute delete

No virus

3.27 kB

	---
	license: apache-2.0
	---


	---

	# Incident Impact Classification Model

	## Model Description

	This model is a fine-tuned version of BERT (bert-base-uncased) designed to classify the impact of incident records based on their short descriptions. The impact is categorized into three levels: low, medium, and high.

	## Intended Use

	Only for Demo pupose - The model is intended to assist in the automatic categorization of incident impacts to streamline incident management processes. It can be used by IT service management teams to quickly identify the severity of incidents based on short descriptions.

	## How to Use

	### Inference

	To use the model for inference, you can utilize the Hugging Face `transformers` library. Below is an example of how to load the model and make predictions:

	```python
	from transformers import pipeline

	model_name = "xeroISB/incidentImpactModel"
	classifier = pipeline("text-classification", model=model_name)

	short_description = "Network outage in building 12"
	prediction = classifier(short_description)
	print(prediction)
	```

	### Training

	The model was trained using the following configuration:
	- Model: BERT (bert-base-uncased)
	- Learning Rate: 2e-5
	- Batch Size: 16
	- Epochs: 3
	- Evaluation Strategy: Epoch
	- Optimizer: AdamW
	- Loss Function: Cross-Entropy Loss

	### Dataset

	The dataset used for training includes the following columns:
	- `short_description`: A brief description of the incident.
	- `impact`: The impact level categorized into three classes: low (3), medium (2), and high (1).

	The `impact` values were mapped to integer labels as follows:
	- 3 (low) -> 0
	- 2 (medium) -> 1
	- 1 (high) -> 2

	### Tokenization

	The `short_description` was tokenized using the BERT tokenizer with padding to the maximum length and truncation enabled.

	## Performance

	### Confusion Matrix

	The confusion matrix on the validation set is as follows:

	\| \| Predicted Low \| Predicted Medium \| Predicted High \|
	\|------\|---------------\|------------------\|----------------\|
	\| Low \| 414 \| 194 \| 0 \|
	\| Medium \| 463 \| 220 \| 0 \|
	\| High \| 33 \| 13 \| 0 \|

	### Classification Report

	```
	Precision - 47%
	Accuracy - 47%
	Recall - 47%
	F1 - 45%

	```

	## Limitations

	- The model's performance is dependent on the quality and representativeness of the training data.
	- It may not perform well on unseen incident descriptions that are significantly different from the training data.
	- The model's predictions are limited to the context of the provided short descriptions and do not take into account other contextual information.

	## Ethical Considerations

	- Ensure the model is used in an ethical manner, considering the potential impact of misclassifications on incident management and prioritization.
	- Regularly monitor the model's performance and update it with new data to maintain accuracy and reliability.

	## License

	This model is released under the [Apache 2.0 License](https://opensource.org/licenses/Apache2.0).

	## Contact Information

	For questions or issues, please contact [Tushar Mishra](mailto:tushar_mishra2023@ampba.isb.edu).

	---