Multilingual parliamentary model XLM-R-parla

This is the XLM-R-large model additionally pre-trained on texts of parliamentary proceedings. Texts for the additional pre-training, 1.7 billion words in size, come from the ParlaMint corpus and the EuroParl corpus.

The model is a result of the ParlaMint project. The details on the model development are described in the following paper:

The first application of this model is the XLM-R-parlasent model, fine-tuned on the ParlaSent dataset for the task of sentiment analysis in parliamentary proceedings.

