Edit model card

eng-nor

  • source language name: English

  • target language name: Norwegian

  • OPUS readme: README.md

  • model: transformer-align

  • source language code: en

  • target language codes: nb, nn

  • dataset: opus with backtranslations

  • release date: 2021-04-20

  • pre-processing: normalization + SentencePiece (spm32k,spm32k)

  • download original weights: opus+bt-2021-04-20.zip

  • a sentence-initial language token is required in the form of >>id<<(id = valid, usually three-letter target language ID)

  • Training data:

    • eng-nno: Tatoeba-train (1661769) wikipedia.aa.nno-eng (995603) wikipedia.ab.nno-eng (605107) wikiquote.aa.nno-eng (22626)
    • eng-nob: Tatoeba-train (11525999) wikibooks.aa.nob-eng (37901) wikinews.aa.nob-eng (8706) wikipedia.aa.nob-eng (992563) wikipedia.ab.nob-eng (992772) wikipedia.ac.nob-eng (992621) wikipedia.ad.nob-eng (992828) wikipedia.ae.nob-eng (992812) wikipedia.af.nob-eng (976715) wikiquote.aa.nob-eng (10443) wikisource.aa.nob-eng (279891)
  • Validation data:

    • eng-nno: Tatoeba-dev, 505
    • eng-nob: Tatoeba-dev, 5189
    • total-size-shuffled: 1505
    • devset-selected: top 1505 lines of Tatoeba-dev.src.shuffled
  • Test data:

    • Tatoeba-test.eng-nno: 460/3428
    • Tatoeba-test.eng-nob: 4539/36110
    • Tatoeba-test.eng-nor: 4999/39547
  • test set translations file: test.txt

  • test set scores file: eval.txt

  • BLEU-scores

    Test set score
    Tatoeba-test.eng-nob 56.4
    Tatoeba-test.eng-nor 55.4
    Tatoeba-test.eng-nno 40.3
  • chr-F-scores

    Test set score
    Tatoeba-test.eng-nob 0.716
    Tatoeba-test.eng-nor 0.71
    Tatoeba-test.eng-nno 0.615

System Info:

  • hf_name: eng-nor
  • source_languages: en
  • target_languages: nb,nn
  • opus_readme_url: https://object.pouta.csc.fi/Tatoeba-MT-models/eng-nor/opus+bt-2021-04-20.zip/README.md
  • original_repo: Tatoeba-Challenge
  • tags: ['translation']
  • languages: ['en', 'nb', 'nn']
  • src_constituents: ['eng']
  • tgt_constituents: ['nob', 'nno']
  • src_multilingual: False
  • tgt_multilingual: True
  • helsinki_git_sha: 59400fea592520766f9910390155681bc930dbc4
  • transformers_git_sha: fd5cdaeea6eafac32e9d967327bfa3dc0e0d962d
  • port_machine: DESKTOP-6CPR2HH
  • port_time: 2023-01-23-21:07
Downloads last month
10
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.