Tanor commited on
Commit
ecd25d5
1 Parent(s): 59f30af

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -0
README.md CHANGED
@@ -36,3 +36,44 @@ model-index:
36
  type: accuracy
37
  value: 0.9859287587
38
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
36
  type: accuracy
37
  value: 0.9859287587
38
  ---
39
+
40
+ sr_pln_tesla_bcx is a spaCy model meticulously fine-tuned for Part-of-Speech Tagging, Lemmatization, and Named Entity Recognition in Serbian language texts. This advanced model incorporates a transformer layer based on XLM-R-BERTić, enhancing its analytical capabilities. It is proficient in identifying 7 distinct categories of entities: PERS (persons), ROLE (professions), DEMO (demonyms), ORG (organizations), LOC (locations), WORK (artworks), and EVENT (events). Detailed information about these categories is available in the accompanying table. The development of this model has been made possible through the support of the Science Fund of the Republic of Serbia, under grant #7276, for the project 'Text Embeddings - Serbian Language Applications - TESLA'.
41
+
42
+ | Feature | Description |
43
+ | --- | --- |
44
+ | **Name** | `sr_sr_pln_tesla_bcx` |
45
+ | **Version** | `1.0.0` |
46
+ | **spaCy** | `>=3.7.2,<3.8.0` |
47
+ | **Default Pipeline** | `transformer`, `tagger`, `trainable_lemmatizer`, `ner` |
48
+ | **Components** | `transformer`, `tagger`, `trainable_lemmatizer`, `ner` |
49
+ | **Vectors** | 0 keys, 0 unique vectors (0 dimensions) |
50
+ | **Sources** | n/a |
51
+ | **License** | `CC BY-SA 3.0` |
52
+ | **Author** | [Milica Ikonić Nešić, Saša Petalinkar, Mihailo Škorić, Ranka Stanković](https://tesla.rgf.bg.ac.rs/) |
53
+
54
+ ### Label Scheme
55
+
56
+ <details>
57
+
58
+ <summary>View label scheme (23 labels for 2 components)</summary>
59
+
60
+ | Component | Labels |
61
+ | --- | --- |
62
+ | **`tagger`** | `ADJ`, `ADP`, `ADV`, `AUX`, `CCONJ`, `DET`, `INTJ`, `NOUN`, `NUM`, `PART`, `PRON`, `PROPN`, `PUNCT`, `SCONJ`, `VERB`, `X` |
63
+ | **`ner`** | `DEMO`, `EVENT`, `LOC`, `ORG`, `PERS`, `ROLE`, `WORK` |
64
+
65
+ </details>
66
+
67
+ ### Accuracy
68
+
69
+ | Type | Score |
70
+ | --- | --- |
71
+ | `TAG_ACC` | 98.46 |
72
+ | `LEMMA_ACC` | 98.59 |
73
+ | `ENTS_F` | 95.69 |
74
+ | `ENTS_P` | 95.39 |
75
+ | `ENTS_R` | 96.00 |
76
+ | `TRANSFORMER_LOSS` | 258521.29 |
77
+ | `TAGGER_LOSS` | 257551.38 |
78
+ | `TRAINABLE_LEMMATIZER_LOSS` | 247675.41 |
79
+ | `NER_LOSS` | 151058.86 |