erst
/

xlm-roberta-base-finetuned-nace

CasperEriksen commited on Mar 2, 2021

Commit

e7fdb9a

•

2 Parent(s): 432fe36 7aa08f7

Merge branch 'main' of https://huggingface.co/erst/xlm-roberta-base-finetuned-nace into main

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,10 +1,10 @@
-# Classifying Text into DB07 Codes
 This model is [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) fine-tuned to classify descriptions of activities into [NACE Rev. 2](https://ec.europa.eu/eurostat/web/nace-rev2) codes.
 ## Data
-The data used to fine-tune the model consist of 2.5 million descriptions of activities from Norwegian and Danish businesses. To improve the model's multilingual performance, random samples were machine translated into the following languages:
 - English
 - German
 - Spanish
@@ -17,8 +17,8 @@ The data used to fine-tune the model consist of 2.5 million descriptions of acti
 ```python
 from transformers import pipeline, AutoTokenizer, AutoModelForSequenceClassification
-tokenizer = AutoTokenizer.from_pretrained("erst/xlm-roberta-base-finetuned-db07")
-model = AutoModelForSequenceClassification.from_pretrained("erst/xlm-roberta-base-finetuned-db07")
 pl = pipeline(
     "sentiment-analysis",

+# Classifying Text into NACE Codes
 This model is [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) fine-tuned to classify descriptions of activities into [NACE Rev. 2](https://ec.europa.eu/eurostat/web/nace-rev2) codes.
 ## Data
+The data used to fine-tune the model consist of 2.5 million descriptions of activities from Norwegian and Danish businesses. To improve the model's multilingual performance, random samples of the Norwegian and Danish descriptions were machine translated into the following languages:
 - English
 - German
 - Spanish
 ```python
 from transformers import pipeline, AutoTokenizer, AutoModelForSequenceClassification
+tokenizer = AutoTokenizer.from_pretrained("erst/xlm-roberta-base-finetuned-nace")
+model = AutoModelForSequenceClassification.from_pretrained("erst/xlm-roberta-base-finetuned-nace")
 pl = pipeline(
     "sentiment-analysis",