File size: 2,669 Bytes
9858ea0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
---
language:
- en
- nb
- nn

tags:
- translation

license: apache-2.0
---

### eng-nor

* source language name: English
* target language name: Norwegian
* OPUS readme: [README.md](https://object.pouta.csc.fi/Tatoeba-MT-models/eng-nor/README.md)

* model: transformer-align
* source language code: en
* target language codes: nb, nn
* dataset: opus  with backtranslations
* release date: 2021-04-20
* pre-processing: normalization + SentencePiece (spm32k,spm32k)
* download original weights: [opus+bt-2021-04-20.zip](https://object.pouta.csc.fi/Tatoeba-MT-models/eng-nor/opus+bt-2021-04-20.zip/eng-nor/opus+bt-2021-04-20.zip)
* a sentence-initial language token is required in the form of >>id<<(id = valid, usually three-letter target language ID)
* Training data: 
  * eng-nno: Tatoeba-train (1661769) wikipedia.aa.nno-eng (995603) wikipedia.ab.nno-eng (605107) wikiquote.aa.nno-eng (22626)
  * eng-nob: Tatoeba-train (11525999) wikibooks.aa.nob-eng (37901) wikinews.aa.nob-eng (8706) wikipedia.aa.nob-eng (992563) wikipedia.ab.nob-eng (992772) wikipedia.ac.nob-eng (992621) wikipedia.ad.nob-eng (992828) wikipedia.ae.nob-eng (992812) wikipedia.af.nob-eng (976715) wikiquote.aa.nob-eng (10443) wikisource.aa.nob-eng (279891)
* Validation data: 
  * eng-nno: Tatoeba-dev, 505
  * eng-nob: Tatoeba-dev, 5189
  * total-size-shuffled: 1505
  * devset-selected: top 1505  lines of Tatoeba-dev.src.shuffled
* Test data: 
  * Tatoeba-test.eng-nno: 460/3428
  * Tatoeba-test.eng-nob: 4539/36110
  * Tatoeba-test.eng-nor: 4999/39547
* test set translations file: [test.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/eng-nor/opus+bt-2021-04-20.zip/eng-nor/opus+bt-2021-04-20.test.txt)
* test set scores file: [eval.txt](https://object.pouta.csc.fi/Tatoeba-MT-models/eng-nor/opus+bt-2021-04-20.zip/eng-nor/opus+bt-2021-04-20.eval.txt)
* BLEU-scores
|Test set|score|
|---|---|
|Tatoeba-test.eng-nob|56.4|
|Tatoeba-test.eng-nor|55.4|
|Tatoeba-test.eng-nno|40.3|
* chr-F-scores
|Test set|score|
|---|---|
|Tatoeba-test.eng-nob|0.716|
|Tatoeba-test.eng-nor|0.71|
|Tatoeba-test.eng-nno|0.615|

### System Info: 
* hf_name: eng-nor
* source_languages: en
* target_languages: nb,nn
* opus_readme_url: https://object.pouta.csc.fi/Tatoeba-MT-models/eng-nor/opus+bt-2021-04-20.zip/README.md
* original_repo: Tatoeba-Challenge
* tags: ['translation']
* languages: ['en', 'nb', 'nn']
* src_constituents: ['eng']
* tgt_constituents: ['nob', 'nno']
* src_multilingual: False
* tgt_multilingual: True
* helsinki_git_sha: 59400fea592520766f9910390155681bc930dbc4
* transformers_git_sha: fd5cdaeea6eafac32e9d967327bfa3dc0e0d962d
* port_machine: DESKTOP-6CPR2HH
* port_time: 2023-01-23-21:07