oliat commited on
Commit
e995be1
1 Parent(s): 8ac591c

Updated Readme

Browse files
Files changed (1) hide show
  1. README.md +65 -3
README.md CHANGED
@@ -1,3 +1,65 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - ru
5
+ ---
6
+
7
+
8
+ # Model Card for xlm-roberta-large-lemma-ru
9
+
10
+ This model is a fine-tuned version of [xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) for the contextual lemmatization task.
11
+ The datasets used for training are extracted from the data of the [SIGMORPHON 2019 Shared Task](https://aclanthology.org/W19-4211/).
12
+ The model for the English language was trained using [EWT corpus]().
13
+
14
+
15
+ # Training Hyperparameters
16
+
17
+ SEED: 42
18
+ EPOCHS: 15
19
+ BATCH SIZE: 8
20
+ GRADIENT ACCUMULATION STEPS: 2
21
+ LEARNING RATE: 0.00002
22
+ WARMUP: 0.06
23
+ WEIGHT DECAY: 0.1
24
+
25
+ # Results
26
+
27
+
28
+
29
+
30
+
31
+ For more details you can see the paper and the repository:
32
+ - 📖 Paper: [On the Role of Morphological Information for Contextual Lemmatization](https://direct.mit.edu/coli/article/50/1/157/118134/On-the-Role-of-Morphological-Information-for)
33
+ - 🌐 Repository: [Datasets and training files](https://github.com/hitz-zentroa/ses-lemma)
34
+
35
+
36
+ **Contact**: [Olia Toporkov](https://www.ixa.eus/node/13292) and [Rodrigo Agerri](https://ragerri.github.io/) HiTZ Center - Ixa, University of the Basque Country UPV/EHU
37
+ **Funding**:
38
+ **Model type**: xlm-roberta-large
39
+ **Language(s) (NLP)**: English
40
+ **License**: apache-2.0
41
+
42
+
43
+
44
+
45
+
46
+ # Citation
47
+
48
+ ```bibtext
49
+ @article{10.1162/coli_a_00497,
50
+ author = {Toporkov, Olia and Agerri, Rodrigo},
51
+ title = "{On the Role of Morphological Information for Contextual
52
+ Lemmatization}",
53
+ journal = {Computational Linguistics},
54
+ volume = {50},
55
+ number = {1},
56
+ pages = {157-191},
57
+ year = {2024},
58
+ month = {03},
59
+ issn = {0891-2017},
60
+ doi = {10.1162/coli_a_00497},
61
+ url = {https://doi.org/10.1162/coli\_a\_00497},
62
+ eprint = {https://direct.mit.edu/coli/article-pdf/50/1/157/2367156/coli\_a\_00497.pdf},
63
+ }
64
+
65
+ ```