Edit model card

industry-classification

Industry Classification Model is designed to classify German or English content into one of 20 different industries. The model is based on DistilBert and fine-tuned using a diverse dataset of approx. 50k records to ensure robust performance in industry classification tasks.

Supported Industries

The model can classify content into the following industries:

  1. Accommodation Services
  2. Administrative and Support Services
  3. Consumer Services
  4. Education
  5. Entertainment Providers
  6. Farming, Ranching, Forestry
  7. Financial Services
  8. Government Administration
  9. Holding Companies
  10. Hospitals and Health Care
  11. Manufacturing
  12. Oil, Gas, and Mining
  13. Professional Services
  14. Real Estate and Equipment Rental Services
  15. Retail
  16. Technology, Information and Media
  17. Transportation, Logistics and Storage
  18. Utilities
  19. Wholesale
  20. Professional Services

Usage

from transformers import DistilBertTokenizer, TFDistilBertForSequenceClassification, pipeline

tokenizer = DistilBertTokenizer.from_pretrained("swarupt/industry-classification")
model = TFDistilBertForSequenceClassification.from_pretrained("swarupt/industry-classification")

label = pipeline('sentiment-analysis', model=model, tokenizer=tokenizer)
label("Consumers enjoy PepsiCo products more than one billion times a day in more than 200 countries and territories. In 2023, PepsiCo generated more than $91 billion in net revenue, driven by a complementary beverage and convenient foods portfolio that includes Lay’s, Doritos, Cheetos, Gatorade, Pepsi-Cola, Mountain Dew, Quaker and SodaStream.")

'''Ouput'''
[{'label': 'Manufacturing', 'score': 0.9472715854644775}]

Supported Languages

  • English
  • German / Deutsch
Downloads last month
18
Inference Examples
Inference API (serverless) has been turned off for this model.