bhavitvyamalik's picture
Upload README.md with huggingface_hub
c4b8792 verified
|
raw
history blame
1.17 kB
metadata
tags:
  - translation
license: cc-by-4.0

Translation model for en-zh_hant HPLT v1.0

This repository contains the model weights for translation models trained with Marian for HPLT project. For usage instructions, evaluation scripts, and inference scripts, please refer to the HPLT-MT-Models v1.0 GitHub repository.

  • Source language: en
  • Target language: zh_hant
  • Dataset: HPLT only
  • Model: transformer-base
  • Tokenizer: SentencePiece (Unigram)
  • Cleaning: We use OpusCleaner for cleaning the corpus. Details about rules used can be found in the filter files in Github

To run inference with Marian, refer to the Inference/Decoding/Translation section of our GitHub repository.

Benchmarks

testset BLEU chr-F comet
flores200 25.4 18.9 0.8454
ntrex 21.3 21.6 0.8042