Back to all models
translation mask_token:
Query this model
🔥 This model is currently loaded and running on the Inference API. ⚠️ This model could not be loaded by the inference API. ⚠️ This model can be loaded on the Inference API on-demand.
JSON Output
API endpoint  

⚡️ Upgrade your account to access the Inference API

Share Copied link to clipboard

Monthly model downloads

Helsinki-NLP/opus-mt-mkh-en Helsinki-NLP/opus-mt-mkh-en
N/a downloads
last 30 days



Contributed by

Language Technology Research Group at the University of Helsinki university
1 team member · 1325 models

How to use this model directly from the 🤗/transformers library:

Copy to clipboard
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("Helsinki-NLP/opus-mt-mkh-en") model = AutoModelForSeq2SeqLM.from_pretrained("Helsinki-NLP/opus-mt-mkh-en")
Uploaded in S3


  • source group: Mon-Khmer languages

  • target group: English

  • OPUS readme: mkh-eng

  • model: transformer

  • source language(s): kha khm khm_Latn mnw vie vie_Hani

  • target language(s): eng

  • model: transformer

  • pre-processing: normalization + SentencePiece (spm32k,spm32k)

  • download original weights:

  • test set translations: opus-2020-07-27.test.txt

  • test set scores: opus-2020-07-27.eval.txt


testset BLEU chr-F
Tatoeba-test.kha-eng.kha.eng 0.5 0.108
Tatoeba-test.khm-eng.khm.eng 8.5 0.206
Tatoeba-test.mnw-eng.mnw.eng 0.7 0.110
Tatoeba-test.multi.eng 24.5 0.407
Tatoeba-test.vie-eng.vie.eng 34.4 0.529

System Info: