fragata's picture
Update README.md
81a847d
metadata
license: mit
language:
  - en
  - hu
tags:
  - text-classification
metrics:
  - mae
  - rmse
  - correlation
widget:
  - text: How much luggage? <sep> Mennyi csomagot Vihetek?

English-Hungarian Qualtiy Estimation with finetuned XLM-RoBERTa model

For further models, scripts and details, see our demo site.

  • Pretrained model used: XLM-RoBERTa base
  • Finetuned on HuQ corpus
  • Labels: regression model
  • Separator: <sep>

Limitations

  • max_seq_length = 256
  • input format: {src_en_sentence} <sep> {tgt_hu_sentence}

Results

10-fold cross validation on HuQ corpus

Model Correlation MAE RMSE
baseline 0.6100 0.7459 0.9775
XLM-R 0.7948 0.6451 0.8898

Citation

If you use this model, please cite the following paper:

@article{yang-rl,
    title = {Enhancing Machine Translation with Quality Estimation and Reinforcement Learning},
    journal = {Annales Mathematicae et Informaticae},
    year = {2023},
    author = {Yang, Zijian Győző and Laki, László János},
    pages = {Accepted}
}