matejulcar commited on
Commit
29ad780
1 Parent(s): 4610cfb

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -0
README.md ADDED
@@ -0,0 +1,22 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - lt
4
+ - lv
5
+ - en
6
+ - multilingual
7
+
8
+ license: cc-by-sa-4.0
9
+ ---
10
+
11
+ # LitLat BERT
12
+ LitLat BERT is a trilingual model, using xlm-roberta-base architecture, trained on Lithuanian, Latvian, and English corpora. Focusing on three languages, the model performs better than [multilingual BERT](https://huggingface.co/bert-base-multilingual-cased), while still offering an option for cross-lingual knowledge transfer, which a monolingual model wouldn't.
13
+
14
+ ### Named entity recognition evaluation
15
+
16
+ We compare LitLat BERT with multilingual BERT (mBERT), XLM-RoBERTa (XLM-R) and monolingual Latvian BERT (LVBERT) (Znotins and Barzdins, 2020). The report the results as a macro F1 score of 3 named entity classes shared in all three datasets: person, location, organization.
17
+
18
+ Language | mBERT | XLM-R | LVBERT | LitLat
19
+ ---|---|---|---|---
20
+ Latvian | 0.830 | 0.865 | 0.797 | **0.881**
21
+ Lithuanian | 0.797 | 0.817 | / | **0.850**
22
+ English | 0.939 | 0.937 | / | **0.943**