Update README.md
Browse files
README.md
CHANGED
@@ -29,6 +29,18 @@ This is a baseline model for named entity **lemmatization** trained on the singl
|
|
29 |
[SlavicNER corpus](https://github.com/SlavicNLP/SlavicNER).
|
30 |
|
31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
# Usage
|
33 |
|
34 |
You can use this model directly with a pipeline for text2text generation:
|
@@ -47,4 +59,8 @@ outputs = pipe(texts)
|
|
47 |
lemmas = [o['generated_text'] for o in outputs]
|
48 |
print(lemmas)
|
49 |
# ['Polska', 'Velká Británie', 'българи', 'Великобритания', 'evropska komisija', 'Європейське агентство лікарських засобів']
|
50 |
-
```
|
|
|
|
|
|
|
|
|
|
29 |
[SlavicNER corpus](https://github.com/SlavicNLP/SlavicNER).
|
30 |
|
31 |
|
32 |
+
# Resources and Technical Documentation
|
33 |
+
|
34 |
+
- Paper: [Cross-lingual Named Entity Corpus for Slavic Languages](https://arxiv.org/pdf/2404.00482), to appear in LREC-COLING 2024.
|
35 |
+
- Annotation guidelines: https://arxiv.org/pdf/2404.00482
|
36 |
+
- SlavicNER Corpus: https://github.com/SlavicNLP/SlavicNER
|
37 |
+
|
38 |
+
|
39 |
+
# Evaluation
|
40 |
+
|
41 |
+
*Will appear soon*
|
42 |
+
|
43 |
+
|
44 |
# Usage
|
45 |
|
46 |
You can use this model directly with a pipeline for text2text generation:
|
|
|
59 |
lemmas = [o['generated_text'] for o in outputs]
|
60 |
print(lemmas)
|
61 |
# ['Polska', 'Velká Británie', 'българи', 'Великобритания', 'evropska komisija', 'Європейське агентство лікарських засобів']
|
62 |
+
```
|
63 |
+
|
64 |
+
# Citation
|
65 |
+
|
66 |
+
*Will appear soon*
|