opus-mt-en-cpp / README.md
system's picture
system HF staff
Update README.md
ec258c0
|
raw
history blame
2.23 kB
metadata
language: en
tags:
  - translation
license: apache-2.0

eng-cpp

  • source group: English

  • target group: Creoles and pidgins, Portuguese-based

  • OPUS readme: eng-cpp

  • model: transformer

  • source language(s): eng

  • target language(s): ind max_Latn min pap tmw_Latn zlm_Latn zsm_Latn

  • model: transformer

  • pre-processing: normalization + SentencePiece (spm32k,spm32k)

  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)

  • download original weights: opus2m-2020-08-01.zip

  • test set translations: opus2m-2020-08-01.test.txt

  • test set scores: opus2m-2020-08-01.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.eng-msa.eng.msa 32.6 0.573
Tatoeba-test.eng.multi 32.7 0.574
Tatoeba-test.eng-pap.eng.pap 42.5 0.633

System Info: