opus-mt-ine-en / README.md
system
Update README.md 2896bfc
1
---
2
language: 
3
- ca
4
- es
5
- os
6
- ro
7
- fy
8
- cy
9
- sc
10
- is
11
- yi
12
- lb
13
- an
14
- sq
15
- fr
16
- ht
17
- rm
18
- ps
19
- af
20
- uk
21
- sl
22
- lt
23
- bg
24
- be
25
- gd
26
- si
27
- en
28
- br
29
- mk
30
- or
31
- mr
32
- ru
33
- fo
34
- co
35
- oc
36
- pl
37
- gl
38
- nb
39
- bn
40
- id
41
- hy
42
- da
43
- gv
44
- nl
45
- pt
46
- hi
47
- as
48
- kw
49
- ga
50
- sv
51
- gu
52
- wa
53
- lv
54
- el
55
- it
56
- hr
57
- ur
58
- nn
59
- de
60
- cs
61
- ine
62
63
tags:
64
- translation
65
66
license: apache-2.0
67
---
68
69
### ine-eng
70
71
* source group: Indo-European languages 
72
* target group: English 
73
*  OPUS readme: [ine-eng](https://github.com/Helsinki-NLP/Tatoeba-Challenge/tree/master/models/ine-eng/README.md)
74
75
*  model: transformer
76
* source language(s): afr aln ang_Latn arg asm ast awa bel bel_Latn ben bho bos_Latn bre bul bul_Latn cat ces cor cos csb_Latn cym dan deu dsb egl ell enm_Latn ext fao fra frm_Latn frr fry gcf_Latn gla gle glg glv gom gos got_Goth grc_Grek gsw guj hat hif_Latn hin hrv hsb hye ind isl ita jdt_Cyrl ksh kur_Arab kur_Latn lad lad_Latn lat_Latn lav lij lit lld_Latn lmo ltg ltz mai mar max_Latn mfe min mkd mwl nds nld nno nob nob_Hebr non_Latn npi oci ori orv_Cyrl oss pan_Guru pap pdc pes pes_Latn pes_Thaa pms pnb pol por prg_Latn pus roh rom ron rue rus san_Deva scn sco sgs sin slv snd_Arab spa sqi srp_Cyrl srp_Latn stq swe swg tgk_Cyrl tly_Latn tmw_Latn ukr urd vec wln yid zlm_Latn zsm_Latn zza
77
* target language(s): eng
78
* model: transformer
79
* pre-processing: normalization + SentencePiece (spm32k,spm32k)
80
* download original weights: [opus2m-2020-08-01.zip](https://object.pouta.csc.fi/Tatoeba-MT-models/ine-eng/opus2m-2020-08-01.zip)
81
* test set translations: [opus2m-2020-08-01.test.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/ine-eng/opus2m-2020-08-01.test.txt)
82
* test set scores: [opus2m-2020-08-01.eval.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/ine-eng/opus2m-2020-08-01.eval.txt)
83
84
## Benchmarks
85
86
| testset               | BLEU  | chr-F |
87
|-----------------------|-------|-------|
88
| newsdev2014-hineng.hin.eng 	| 11.2 	| 0.375 |
89
| newsdev2016-enro-roneng.ron.eng 	| 35.5 	| 0.614 |
90
| newsdev2017-enlv-laveng.lav.eng 	| 25.1 	| 0.542 |
91
| newsdev2019-engu-gujeng.guj.eng 	| 16.0 	| 0.420 |
92
| newsdev2019-enlt-liteng.lit.eng 	| 24.0 	| 0.522 |
93
| newsdiscussdev2015-enfr-fraeng.fra.eng 	| 30.1 	| 0.550 |
94
| newsdiscusstest2015-enfr-fraeng.fra.eng 	| 33.4 	| 0.572 |
95
| newssyscomb2009-ceseng.ces.eng 	| 24.0 	| 0.520 |
96
| newssyscomb2009-deueng.deu.eng 	| 25.7 	| 0.526 |
97
| newssyscomb2009-fraeng.fra.eng 	| 27.9 	| 0.550 |
98
| newssyscomb2009-itaeng.ita.eng 	| 31.4 	| 0.574 |
99
| newssyscomb2009-spaeng.spa.eng 	| 28.3 	| 0.555 |
100
| news-test2008-deueng.deu.eng 	| 24.0 	| 0.515 |
101
| news-test2008-fraeng.fra.eng 	| 24.5 	| 0.524 |
102
| news-test2008-spaeng.spa.eng 	| 25.5 	| 0.533 |
103
| newstest2009-ceseng.ces.eng 	| 23.3 	| 0.516 |
104
| newstest2009-deueng.deu.eng 	| 23.2 	| 0.512 |
105
| newstest2009-fraeng.fra.eng 	| 27.3 	| 0.545 |
106
| newstest2009-itaeng.ita.eng 	| 30.3 	| 0.567 |
107
| newstest2009-spaeng.spa.eng 	| 27.9 	| 0.549 |
108
| newstest2010-ceseng.ces.eng 	| 23.8 	| 0.523 |
109
| newstest2010-deueng.deu.eng 	| 26.2 	| 0.545 |
110
| newstest2010-fraeng.fra.eng 	| 28.6 	| 0.562 |
111
| newstest2010-spaeng.spa.eng 	| 31.4 	| 0.581 |
112
| newstest2011-ceseng.ces.eng 	| 24.2 	| 0.521 |
113
| newstest2011-deueng.deu.eng 	| 23.9 	| 0.522 |
114
| newstest2011-fraeng.fra.eng 	| 29.5 	| 0.570 |
115
| newstest2011-spaeng.spa.eng 	| 30.3 	| 0.570 |
116
| newstest2012-ceseng.ces.eng 	| 23.5 	| 0.516 |
117
| newstest2012-deueng.deu.eng 	| 24.9 	| 0.529 |
118
| newstest2012-fraeng.fra.eng 	| 30.0 	| 0.568 |
119
| newstest2012-ruseng.rus.eng 	| 29.9 	| 0.565 |
120
| newstest2012-spaeng.spa.eng 	| 33.3 	| 0.593 |
121
| newstest2013-ceseng.ces.eng 	| 25.6 	| 0.531 |
122
| newstest2013-deueng.deu.eng 	| 27.7 	| 0.545 |
123
| newstest2013-fraeng.fra.eng 	| 30.0 	| 0.561 |
124
| newstest2013-ruseng.rus.eng 	| 24.4 	| 0.514 |
125
| newstest2013-spaeng.spa.eng 	| 30.8 	| 0.577 |
126
| newstest2014-csen-ceseng.ces.eng 	| 27.7 	| 0.558 |
127
| newstest2014-deen-deueng.deu.eng 	| 27.7 	| 0.545 |
128
| newstest2014-fren-fraeng.fra.eng 	| 32.2 	| 0.592 |
129
| newstest2014-hien-hineng.hin.eng 	| 16.7 	| 0.450 |
130
| newstest2014-ruen-ruseng.rus.eng 	| 27.2 	| 0.552 |
131
| newstest2015-encs-ceseng.ces.eng 	| 25.4 	| 0.518 |
132
| newstest2015-ende-deueng.deu.eng 	| 28.8 	| 0.552 |
133
| newstest2015-enru-ruseng.rus.eng 	| 25.6 	| 0.527 |
134
| newstest2016-encs-ceseng.ces.eng 	| 27.0 	| 0.540 |
135
| newstest2016-ende-deueng.deu.eng 	| 33.5 	| 0.592 |
136
| newstest2016-enro-roneng.ron.eng 	| 32.8 	| 0.591 |
137
| newstest2016-enru-ruseng.rus.eng 	| 24.8 	| 0.523 |
138
| newstest2017-encs-ceseng.ces.eng 	| 23.7 	| 0.510 |
139
| newstest2017-ende-deueng.deu.eng 	| 29.3 	| 0.556 |
140
| newstest2017-enlv-laveng.lav.eng 	| 18.9 	| 0.486 |
141
| newstest2017-enru-ruseng.rus.eng 	| 28.0 	| 0.546 |
142
| newstest2018-encs-ceseng.ces.eng 	| 24.9 	| 0.521 |
143
| newstest2018-ende-deueng.deu.eng 	| 36.0 	| 0.604 |
144
| newstest2018-enru-ruseng.rus.eng 	| 23.8 	| 0.517 |
145
| newstest2019-deen-deueng.deu.eng 	| 31.5 	| 0.570 |
146
| newstest2019-guen-gujeng.guj.eng 	| 12.1 	| 0.377 |
147
| newstest2019-lten-liteng.lit.eng 	| 26.6 	| 0.555 |
148
| newstest2019-ruen-ruseng.rus.eng 	| 27.5 	| 0.541 |
149
| Tatoeba-test.afr-eng.afr.eng 	| 59.0 	| 0.724 |
150
| Tatoeba-test.ang-eng.ang.eng 	| 9.9 	| 0.254 |
151
| Tatoeba-test.arg-eng.arg.eng 	| 41.6 	| 0.487 |
152
| Tatoeba-test.asm-eng.asm.eng 	| 22.8 	| 0.392 |
153
| Tatoeba-test.ast-eng.ast.eng 	| 36.1 	| 0.521 |
154
| Tatoeba-test.awa-eng.awa.eng 	| 11.6 	| 0.280 |
155
| Tatoeba-test.bel-eng.bel.eng 	| 42.2 	| 0.597 |
156
| Tatoeba-test.ben-eng.ben.eng 	| 45.8 	| 0.598 |
157
| Tatoeba-test.bho-eng.bho.eng 	| 34.4 	| 0.518 |
158
| Tatoeba-test.bre-eng.bre.eng 	| 24.4 	| 0.405 |
159
| Tatoeba-test.bul-eng.bul.eng 	| 50.8 	| 0.660 |
160
| Tatoeba-test.cat-eng.cat.eng 	| 51.2 	| 0.677 |
161
| Tatoeba-test.ces-eng.ces.eng 	| 47.6 	| 0.641 |
162
| Tatoeba-test.cor-eng.cor.eng 	| 5.4 	| 0.214 |
163
| Tatoeba-test.cos-eng.cos.eng 	| 61.0 	| 0.675 |
164
| Tatoeba-test.csb-eng.csb.eng 	| 22.5 	| 0.394 |
165
| Tatoeba-test.cym-eng.cym.eng 	| 34.7 	| 0.522 |
166
| Tatoeba-test.dan-eng.dan.eng 	| 56.2 	| 0.708 |
167
| Tatoeba-test.deu-eng.deu.eng 	| 44.9 	| 0.625 |
168
| Tatoeba-test.dsb-eng.dsb.eng 	| 21.0 	| 0.383 |
169
| Tatoeba-test.egl-eng.egl.eng 	| 6.9 	| 0.221 |
170
| Tatoeba-test.ell-eng.ell.eng 	| 62.1 	| 0.741 |
171
| Tatoeba-test.enm-eng.enm.eng 	| 22.6 	| 0.466 |
172
| Tatoeba-test.ext-eng.ext.eng 	| 33.2 	| 0.496 |
173
| Tatoeba-test.fao-eng.fao.eng 	| 28.1 	| 0.460 |
174
| Tatoeba-test.fas-eng.fas.eng 	| 9.6 	| 0.306 |
175
| Tatoeba-test.fra-eng.fra.eng 	| 50.3 	| 0.661 |
176
| Tatoeba-test.frm-eng.frm.eng 	| 30.0 	| 0.457 |
177
| Tatoeba-test.frr-eng.frr.eng 	| 15.2 	| 0.301 |
178
| Tatoeba-test.fry-eng.fry.eng 	| 34.4 	| 0.525 |
179
| Tatoeba-test.gcf-eng.gcf.eng 	| 18.4 	| 0.317 |
180
| Tatoeba-test.gla-eng.gla.eng 	| 24.1 	| 0.400 |
181
| Tatoeba-test.gle-eng.gle.eng 	| 52.2 	| 0.671 |
182
| Tatoeba-test.glg-eng.glg.eng 	| 50.5 	| 0.669 |
183
| Tatoeba-test.glv-eng.glv.eng 	| 5.7 	| 0.189 |
184
| Tatoeba-test.gos-eng.gos.eng 	| 19.2 	| 0.378 |
185
| Tatoeba-test.got-eng.got.eng 	| 0.1 	| 0.022 |
186
| Tatoeba-test.grc-eng.grc.eng 	| 0.9 	| 0.095 |
187
| Tatoeba-test.gsw-eng.gsw.eng 	| 23.9 	| 0.390 |
188
| Tatoeba-test.guj-eng.guj.eng 	| 28.0 	| 0.428 |
189
| Tatoeba-test.hat-eng.hat.eng 	| 44.2 	| 0.567 |
190
| Tatoeba-test.hbs-eng.hbs.eng 	| 51.6 	| 0.666 |
191
| Tatoeba-test.hif-eng.hif.eng 	| 22.3 	| 0.451 |
192
| Tatoeba-test.hin-eng.hin.eng 	| 41.7 	| 0.585 |
193
| Tatoeba-test.hsb-eng.hsb.eng 	| 46.4 	| 0.590 |
194
| Tatoeba-test.hye-eng.hye.eng 	| 40.4 	| 0.564 |
195
| Tatoeba-test.isl-eng.isl.eng 	| 43.8 	| 0.605 |
196
| Tatoeba-test.ita-eng.ita.eng 	| 60.7 	| 0.735 |
197
| Tatoeba-test.jdt-eng.jdt.eng 	| 5.5 	| 0.091 |
198
| Tatoeba-test.kok-eng.kok.eng 	| 7.8 	| 0.205 |
199
| Tatoeba-test.ksh-eng.ksh.eng 	| 15.8 	| 0.284 |
200
| Tatoeba-test.kur-eng.kur.eng 	| 11.6 	| 0.232 |
201
| Tatoeba-test.lad-eng.lad.eng 	| 30.7 	| 0.484 |
202
| Tatoeba-test.lah-eng.lah.eng 	| 11.0 	| 0.286 |
203
| Tatoeba-test.lat-eng.lat.eng 	| 24.4 	| 0.432 |
204
| Tatoeba-test.lav-eng.lav.eng 	| 47.2 	| 0.646 |
205
| Tatoeba-test.lij-eng.lij.eng 	| 9.0 	| 0.287 |
206
| Tatoeba-test.lit-eng.lit.eng 	| 51.7 	| 0.670 |
207
| Tatoeba-test.lld-eng.lld.eng 	| 22.4 	| 0.369 |
208
| Tatoeba-test.lmo-eng.lmo.eng 	| 26.1 	| 0.381 |
209
| Tatoeba-test.ltz-eng.ltz.eng 	| 39.8 	| 0.536 |
210
| Tatoeba-test.mai-eng.mai.eng 	| 72.3 	| 0.758 |
211
| Tatoeba-test.mar-eng.mar.eng 	| 32.0 	| 0.554 |
212
| Tatoeba-test.mfe-eng.mfe.eng 	| 63.1 	| 0.822 |
213
| Tatoeba-test.mkd-eng.mkd.eng 	| 49.5 	| 0.638 |
214
| Tatoeba-test.msa-eng.msa.eng 	| 38.6 	| 0.566 |
215
| Tatoeba-test.multi.eng 	| 45.6 	| 0.615 |
216
| Tatoeba-test.mwl-eng.mwl.eng 	| 40.4 	| 0.767 |
217
| Tatoeba-test.nds-eng.nds.eng 	| 35.5 	| 0.538 |
218
| Tatoeba-test.nep-eng.nep.eng 	| 4.9 	| 0.209 |
219
| Tatoeba-test.nld-eng.nld.eng 	| 54.2 	| 0.694 |
220
| Tatoeba-test.non-eng.non.eng 	| 39.3 	| 0.573 |
221
| Tatoeba-test.nor-eng.nor.eng 	| 50.9 	| 0.663 |
222
| Tatoeba-test.oci-eng.oci.eng 	| 19.6 	| 0.386 |
223
| Tatoeba-test.ori-eng.ori.eng 	| 16.2 	| 0.364 |
224
| Tatoeba-test.orv-eng.orv.eng 	| 13.6 	| 0.288 |
225
| Tatoeba-test.oss-eng.oss.eng 	| 9.4 	| 0.301 |
226
| Tatoeba-test.pan-eng.pan.eng 	| 17.1 	| 0.389 |
227
| Tatoeba-test.pap-eng.pap.eng 	| 57.0 	| 0.680 |
228
| Tatoeba-test.pdc-eng.pdc.eng 	| 41.6 	| 0.526 |
229
| Tatoeba-test.pms-eng.pms.eng 	| 13.7 	| 0.333 |
230
| Tatoeba-test.pol-eng.pol.eng 	| 46.5 	| 0.632 |
231
| Tatoeba-test.por-eng.por.eng 	| 56.4 	| 0.710 |
232
| Tatoeba-test.prg-eng.prg.eng 	| 2.3 	| 0.193 |
233
| Tatoeba-test.pus-eng.pus.eng 	| 3.2 	| 0.194 |
234
| Tatoeba-test.roh-eng.roh.eng 	| 17.5 	| 0.420 |
235
| Tatoeba-test.rom-eng.rom.eng 	| 5.0 	| 0.237 |
236
| Tatoeba-test.ron-eng.ron.eng 	| 51.4 	| 0.670 |
237
| Tatoeba-test.rue-eng.rue.eng 	| 26.0 	| 0.447 |
238
| Tatoeba-test.rus-eng.rus.eng 	| 47.8 	| 0.634 |
239
| Tatoeba-test.san-eng.san.eng 	| 4.0 	| 0.195 |
240
| Tatoeba-test.scn-eng.scn.eng 	| 45.1 	| 0.440 |
241
| Tatoeba-test.sco-eng.sco.eng 	| 41.9 	| 0.582 |
242
| Tatoeba-test.sgs-eng.sgs.eng 	| 38.7 	| 0.498 |
243
| Tatoeba-test.sin-eng.sin.eng 	| 29.7 	| 0.499 |
244
| Tatoeba-test.slv-eng.slv.eng 	| 38.2 	| 0.564 |
245
| Tatoeba-test.snd-eng.snd.eng 	| 12.7 	| 0.342 |
246
| Tatoeba-test.spa-eng.spa.eng 	| 53.2 	| 0.687 |
247
| Tatoeba-test.sqi-eng.sqi.eng 	| 51.9 	| 0.679 |
248
| Tatoeba-test.stq-eng.stq.eng 	| 9.0 	| 0.391 |
249
| Tatoeba-test.swe-eng.swe.eng 	| 57.4 	| 0.705 |
250
| Tatoeba-test.swg-eng.swg.eng 	| 18.0 	| 0.338 |
251
| Tatoeba-test.tgk-eng.tgk.eng 	| 24.3 	| 0.413 |
252
| Tatoeba-test.tly-eng.tly.eng 	| 1.1 	| 0.094 |
253
| Tatoeba-test.ukr-eng.ukr.eng 	| 48.0 	| 0.639 |
254
| Tatoeba-test.urd-eng.urd.eng 	| 27.2 	| 0.471 |
255
| Tatoeba-test.vec-eng.vec.eng 	| 28.0 	| 0.398 |
256
| Tatoeba-test.wln-eng.wln.eng 	| 17.5 	| 0.320 |
257
| Tatoeba-test.yid-eng.yid.eng 	| 26.9 	| 0.457 |
258
| Tatoeba-test.zza-eng.zza.eng 	| 1.7 	| 0.131 |
259
260
261
### System Info: 
262
- hf_name: ine-eng
263
264
- source_languages: ine
265
266
- target_languages: eng
267
268
- opus_readme_url: https://github.com/Helsinki-NLP/Tatoeba-Challenge/tree/master/models/ine-eng/README.md
269
270
- original_repo: Tatoeba-Challenge
271
272
- tags: ['translation']
273
274
- languages: ['ca', 'es', 'os', 'ro', 'fy', 'cy', 'sc', 'is', 'yi', 'lb', 'an', 'sq', 'fr', 'ht', 'rm', 'ps', 'af', 'uk', 'sl', 'lt', 'bg', 'be', 'gd', 'si', 'en', 'br', 'mk', 'or', 'mr', 'ru', 'fo', 'co', 'oc', 'pl', 'gl', 'nb', 'bn', 'id', 'hy', 'da', 'gv', 'nl', 'pt', 'hi', 'as', 'kw', 'ga', 'sv', 'gu', 'wa', 'lv', 'el', 'it', 'hr', 'ur', 'nn', 'de', 'cs', 'ine']
275
276
- src_constituents: {'cat', 'spa', 'pap', 'mwl', 'lij', 'bos_Latn', 'lad_Latn', 'lat_Latn', 'pcd', 'oss', 'ron', 'fry', 'cym', 'awa', 'swg', 'zsm_Latn', 'srd', 'gcf_Latn', 'isl', 'yid', 'bho', 'ltz', 'kur_Latn', 'arg', 'pes_Thaa', 'sqi', 'csb_Latn', 'fra', 'hat', 'non_Latn', 'sco', 'pnb', 'roh', 'bul_Latn', 'pus', 'afr', 'ukr', 'slv', 'lit', 'tmw_Latn', 'hsb', 'tly_Latn', 'bul', 'bel', 'got_Goth', 'lat_Grek', 'ext', 'gla', 'mai', 'sin', 'hif_Latn', 'eng', 'bre', 'nob_Hebr', 'prg_Latn', 'ang_Latn', 'aln', 'mkd', 'ori', 'mar', 'afr_Arab', 'san_Deva', 'gos', 'rus', 'fao', 'orv_Cyrl', 'bel_Latn', 'cos', 'zza', 'grc_Grek', 'oci', 'mfe', 'gom', 'bjn', 'sgs', 'tgk_Cyrl', 'hye_Latn', 'pdc', 'srp_Cyrl', 'pol', 'ast', 'glg', 'pms', 'nob', 'ben', 'min', 'srp_Latn', 'zlm_Latn', 'ind', 'rom', 'hye', 'scn', 'enm_Latn', 'lmo', 'npi', 'pes', 'dan', 'rus_Latn', 'jdt_Cyrl', 'gsw', 'glv', 'nld', 'snd_Arab', 'kur_Arab', 'por', 'hin', 'dsb', 'asm', 'lad', 'frm_Latn', 'ksh', 'pan_Guru', 'cor', 'gle', 'swe', 'guj', 'wln', 'lav', 'ell', 'frr', 'rue', 'ita', 'hrv', 'urd', 'stq', 'nno', 'deu', 'lld_Latn', 'ces', 'egl', 'vec', 'max_Latn', 'pes_Latn', 'ltg', 'nds'}
277
278
- tgt_constituents: {'eng'}
279
280
- src_multilingual: True
281
282
- tgt_multilingual: False
283
284
- prepro:  normalization + SentencePiece (spm32k,spm32k)
285
286
- url_model: https://object.pouta.csc.fi/Tatoeba-MT-models/ine-eng/opus2m-2020-08-01.zip
287
288
- url_test_set: https://object.pouta.csc.fi/Tatoeba-MT-models/ine-eng/opus2m-2020-08-01.test.txt
289
290
- src_alpha3: ine
291
292
- tgt_alpha3: eng
293
294
- short_pair: ine-en
295
296
- chrF2_score: 0.615
297
298
- bleu: 45.6
299
300
- brevity_penalty: 0.997
301
302
- ref_len: 71872.0
303
304
- src_name: Indo-European languages
305
306
- tgt_name: English
307
308
- train_date: 2020-08-01
309
310
- src_alpha2: ine
311
312
- tgt_alpha2: en
313
314
- prefer_old: False
315
316
- long_pair: ine-eng
317
318
- helsinki_git_sha: 480fcbe0ee1bf4774bcbe6226ad9f58e63f6c535
319
320
- transformers_git_sha: 2207e5d8cb224e954a7cba69fa4ac2309e9ff30b
321
322
- port_machine: brutasse
323
324
- port_time: 2020-08-21-14:41