metadata
tags:
- generated_from_keras_callback
model-index:
- name: industry-classification
results: []
license: apache-2.0
inference: false
language:
- en
- de
industry-classification
Industry Classification Model is designed to classify German or English content into one of 20 different industries. The model is based on DistilBert and fine-tuned using a diverse dataset of approx. 50k records to ensure robust performance in industry classification tasks.
Supported Industries
The model can classify content into the following industries:
- Accommodation Services
- Administrative and Support Services
- Consumer Services
- Education
- Entertainment Providers
- Farming, Ranching, Forestry
- Financial Services
- Government Administration
- Holding Companies
- Hospitals and Health Care
- Manufacturing
- Oil, Gas, and Mining
- Professional Services
- Real Estate and Equipment Rental Services
- Retail
- Technology, Information and Media
- Transportation, Logistics and Storage
- Utilities
- Wholesale
- Professional Services
Usage
from transformers import DistilBertTokenizer, TFDistilBertForSequenceClassification, pipeline
tokenizer = DistilBertTokenizer.from_pretrained("swarupt/industry-classification")
model = TFDistilBertForSequenceClassification.from_pretrained("swarupt/industry-classification")
label = pipeline('sentiment-analysis', model=model, tokenizer=tokenizer)
label("Consumers enjoy PepsiCo products more than one billion times a day in more than 200 countries and territories. In 2023, PepsiCo generated more than $91 billion in net revenue, driven by a complementary beverage and convenient foods portfolio that includes Lay’s, Doritos, Cheetos, Gatorade, Pepsi-Cola, Mountain Dew, Quaker and SodaStream.")
'''Ouput'''
[{'label': 'Manufacturing', 'score': 0.9472715854644775}]
Supported Languages
- English
- German / Deutsch