bert-base-multilingual-cased-conloan-lv

Description

This model is part of a Bachelor's thesis at the University of Latvia: "Contextual approach to Latvian loanword detection: dataset creation and classification experiments".

It is a fine-tuned version of bert-base-multilingual-cased on the baseline dataset.

Classes and Labels

Dataset Type: {Baseline (Binary) / Extended (Contrastive)} Labels:

  • O: Outside
  • LOAN: Borrowing (Materiālie aizguvumi)

Performance (k-fold average)

  • F1 Score: {0.899}
  • Std Dev (σ): {0.009}

Usage

from transformers import pipeline
nlp = pipeline("ner", model="jorenchik/bert-base-multilingual-cased-conloan-lv")
nlp("Šodienas mītings bija ļoti produktīvs.")
Downloads last month
53
Safetensors
Model size
0.2B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train jorenchik/bert-base-multilingual-cased-conloan-lv

Collection including jorenchik/bert-base-multilingual-cased-conloan-lv