skatturinn
/

isat-2008

Text Classification

Model card Files Files and versions Community

Classifying Text into NACE Codes

This model is xlm-roberta-base fine-tuned to classify descriptions of activities into NACE Rev. 2 codes. It is based on xlm-roberta-base-finetuned-nace.

Data

The data used to fine-tune the model consist of descriptions of activities from Icelandic businesses found in their annual reports.

Quick Start

from transformers import pipeline, AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("skatturinn/isat-2008")
model = AutoModelForSequenceClassification.from_pretrained("skatturinn/isat-2008")

pl = pipeline(
    "text-classification",
    model=model,
    tokenizer=tokenizer,
    return_all_scores=False,
)

pl("Ég er smiður og ætla mér að smíða hús")

Downloads last month: 29

Safetensors

Model size

279M params

Tensor type

F32

·

Inference Providers NEW

Text Classification

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for skatturinn/isat-2008

Base model

erst/xlm-roberta-base-finetuned-nace

Finetuned

(1)

this model