metadata
license: apache-2.0
datasets:
- AyoubChLin/CNN_News_Articles_2011-2022
language:
- en
metrics:
- accuracy
pipeline_tag: zero-shot-classification
DistilBERT for Zero Shot Classification
This repository contains a DistilBERT model trained for zero-shot classification on CNN articles. The model has been evaluated on CNN articles and achieved an accuracy of 0.956 and an F1 score of 0.955.
Model Details
- Architecture: DistilBERT
- Training Data: CNN articles
- Accuracy: 0.956
- F1 Score: 0.955
Usage
To use this model for zero-shot classification, you can follow the steps below:
Load the trained model:
from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("AyoubChLin/DistilBERT_ZeroShot") model = AutoModelForSequenceClassification.from_pretrained("AyoubChLin/DistilBERT_ZeroShot")
Classify text using zero-shot classification:
from transformers import pipeline # Create a zero-shot classification pipeline classifier = pipeline("zero-shot-classification", model=model, tokenizer=tokenizer) # Classify a sentence sentence = "The latest scientific breakthroughs in medicine" candidate_labels = ["politics", "sports", "technology", "business"] result = classifier(sentence, candidate_labels) print(result)
The output will be a dictionary containing the classified label and the corresponding classification score.
About the Author
This work was created by Ayoub Cherguelaine.
If you have any questions or suggestions regarding this repository or the trained model, feel free to reach out to Ayoub Cherguelaine.