opus-mt-pl-uk / README.md
system
Update README.md 5639589
1 ---
2 language:
3 - pl
4 - uk
5
6 tags:
7 - translation
8
9 license: apache-2.0
10 ---
11
12 ### pol-ukr
13
14 * source group: Polish
15 * target group: Ukrainian
16 * OPUS readme: [pol-ukr](https://github.com/Helsinki-NLP/Tatoeba-Challenge/tree/master/models/pol-ukr/README.md)
17
18 * model: transformer-align
19 * source language(s): pol
20 * target language(s): ukr
21 * model: transformer-align
22 * pre-processing: normalization + SentencePiece (spm32k,spm32k)
23 * download original weights: [opus-2020-06-17.zip](https://object.pouta.csc.fi/Tatoeba-MT-models/pol-ukr/opus-2020-06-17.zip)
24 * test set translations: [opus-2020-06-17.test.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/pol-ukr/opus-2020-06-17.test.txt)
25 * test set scores: [opus-2020-06-17.eval.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/pol-ukr/opus-2020-06-17.eval.txt)
26
27 ## Benchmarks
28
29 | testset | BLEU | chr-F |
30 |-----------------------|-------|-------|
31 | Tatoeba-test.pol.ukr | 47.1 | 0.665 |
32
33
34 ### System Info:
35 - hf_name: pol-ukr
36
37 - source_languages: pol
38
39 - target_languages: ukr
40
41 - opus_readme_url: https://github.com/Helsinki-NLP/Tatoeba-Challenge/tree/master/models/pol-ukr/README.md
42
43 - original_repo: Tatoeba-Challenge
44
45 - tags: ['translation']
46
47 - languages: ['pl', 'uk']
48
49 - src_constituents: {'pol'}
50
51 - tgt_constituents: {'ukr'}
52
53 - src_multilingual: False
54
55 - tgt_multilingual: False
56
57 - prepro: normalization + SentencePiece (spm32k,spm32k)
58
59 - url_model: https://object.pouta.csc.fi/Tatoeba-MT-models/pol-ukr/opus-2020-06-17.zip
60
61 - url_test_set: https://object.pouta.csc.fi/Tatoeba-MT-models/pol-ukr/opus-2020-06-17.test.txt
62
63 - src_alpha3: pol
64
65 - tgt_alpha3: ukr
66
67 - short_pair: pl-uk
68
69 - chrF2_score: 0.665
70
71 - bleu: 47.1
72
73 - brevity_penalty: 0.992
74
75 - ref_len: 13434.0
76
77 - src_name: Polish
78
79 - tgt_name: Ukrainian
80
81 - train_date: 2020-06-17
82
83 - src_alpha2: pl
84
85 - tgt_alpha2: uk
86
87 - prefer_old: False
88
89 - long_pair: pol-ukr
90
91 - helsinki_git_sha: 480fcbe0ee1bf4774bcbe6226ad9f58e63f6c535
92
93 - transformers_git_sha: 2207e5d8cb224e954a7cba69fa4ac2309e9ff30b
94
95 - port_machine: brutasse
96
97 - port_time: 2020-08-21-14:41