|
--- |
|
license: apache-2.0 |
|
language: |
|
- zh |
|
- en |
|
model-index: |
|
- name: skNMT-zh-en-1.2 |
|
results: |
|
- task: |
|
type: translation |
|
metrics: |
|
- name: BLEU |
|
type: BLEU |
|
value: 20.4218 |
|
- name: chrF |
|
type: chrf |
|
value: 50.2827 |
|
dataset: |
|
name: WMT 2019 |
|
type: wmt/wmt19 |
|
--- |
|
# skNMT-zh-en-1.2 |
|
|
|
The NMT (Neural Machine Translation) model trained by sparkastML for translating from Chinese to English. |
|
|
|
This model use [OpenNMT](https://opennmt.net/) as its underlying structure. |
|
|
|
## Usage |
|
|
|
We have already exported the model into CTranslate2-compatible format. You can download the necessary files (`model.bin`, `config.json` and `shared_vocabulary.json`), |
|
and start with the [CTranslate2](https://github.com/OpenNMT/CTranslate2). |
|
|
|
We alsow provide the training checkpoint and the sentencepiece model, so you can manually inference via OpenNMT. |
|
|
|
## Model Details |
|
|
|
- **Source Language:** Chinese (Simplified) |
|
- **Target Language:** English |
|
- **Training Time:** Totally 11.3 hours, 46,500 steps (~1×10¹⁸ FLOPs) |
|
- **Training Device:** |
|
- RTX 3080 (20GB): step 0-20,000 |
|
- RTX 4070: step 20,000-46,500 |
|
- **Corpus Size:** Over 10 million sentences |
|
- **Validation BLEU Score:** 21.28 |
|
- **Validation Loss (Cross Entropy):** 3.152 |
|
|