Token Classification
Transformers
PyTorch
4 languages
xlm-roberta
Inference Endpoints
mbruton commited on
Commit
53e5c35
1 Parent(s): c1ea3b6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -6
README.md CHANGED
@@ -156,11 +156,22 @@ Per label type:
156
  **BibTeX:**
157
 
158
  ```
159
- @mastersthesis{bruton-galician-srl-23,
160
- author = {Bruton, Micaella},
161
- title = {BERTie Bott's Every Flavor Labels: A Tasty Guide to Developing a Semantic Role Labeling Model for Galician},
162
- school = {Uppsala University},
163
- year = {2023},
164
- type = {Master's thesis},
 
 
 
 
 
 
 
 
 
 
165
  }
 
166
  ```
 
156
  **BibTeX:**
157
 
158
  ```
159
+ @inproceedings{bruton-beloucif-2023-bertie,
160
+ title = "{BERT}ie Bott{'}s Every Flavor Labels: A Tasty Introduction to Semantic Role Labeling for {G}alician",
161
+ author = "Bruton, Micaella and
162
+ Beloucif, Meriem",
163
+ editor = "Bouamor, Houda and
164
+ Pino, Juan and
165
+ Bali, Kalika",
166
+ booktitle = "Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing",
167
+ month = dec,
168
+ year = "2023",
169
+ address = "Singapore",
170
+ publisher = "Association for Computational Linguistics",
171
+ url = "https://aclanthology.org/2023.emnlp-main.671",
172
+ doi = "10.18653/v1/2023.emnlp-main.671",
173
+ pages = "10892--10902",
174
+ abstract = "In this paper, we leverage existing corpora, WordNet, and dependency parsing to build the first Galician dataset for training semantic role labeling systems in an effort to expand available NLP resources. Additionally, we introduce verb indexing, a new pre-processing method, which helps increase the performance when semantically parsing highly-complex sentences. We use transfer-learning to test both the resource and the verb indexing method. Our results show that the effects of verb indexing were amplified in scenarios where the model was both pre-trained and fine-tuned on datasets utilizing the method, but improvements are also noticeable when only used during fine-tuning. The best-performing Galician SRL model achieved an f1 score of 0.74, introducing a baseline for future Galician SRL systems. We also tested our method on Spanish where we achieved an f1 score of 0.83, outperforming the baseline set by the 2009 CoNLL Shared Task by 0.025 showing the merits of our verb indexing method for pre-processing.",
175
  }
176
+
177
  ```