File size: 1,466 Bytes
1cbd547
 
 
 
 
 
 
f34820e
1cbd547
 
 
 
 
 
 
 
 
 
0f3b394
49feea6
0f3b394
1cbd547
 
 
 
 
 
 
 
 
 
3c591f1
1cbd547
23372bf
1cbd547
 
 
 
 
 
 
834056a
1cbd547
 
 
 
 
 
 
 
 
 
c59b7f9
e7ba362
1cbd547
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
---

language: 
  - en
  - hu
tags:
- translation
license: gpl
metrics:
- sacrebleu
- chrf
widget:
- text: "This may not make much sense to you, sir, but I'd like to ask your permission to date your daughter."
  example_title: "Translation: English-Hungarian"

---

# BART Translation model

For further models, scripts and details, see [our repository](https://github.com/nytud/machine-translation) or [our demo site](https://juniper.nytud.hu/demo/nlp).

- Source language: English
- Target language: Hungarian

- BART base model:
  - Pretrained on English WikiText-103 and Hungarian Wikipedia
  - Finetuned on subcorpora from OPUS
  	- Segments: 56.837.602

## Limitations

- tokenized input text (tokenizer: [HuSpaCy](https://huggingface.co/huspacy))
- max_source_length = 128
- max_target_length = 128

## Results

| Model | BLEU | chrF-3 | chrF-6 |
| ------------- | ------------- | ------------- | ------------- |
| Google | 25.30 | 54.09 | 49.0 |
| **BART** | **36.89** | **60.77** | **56.4** |
| mT5 | 27.69  | 53.73 | 48.57 |

## Citation
If you use this model, please cite the following paper:
```
@inproceedings {laki-yang-mt,
    title = {{Jobban fordítunk magyarra, mint a Google!}},
	booktitle = {XVIII. Magyar Számítógépes Nyelvészeti Konferencia},
	year = {2022},
	publisher = {Szegedi Tudományegyetem, Informatikai Intézet},
	address = {Szeged, Magyarország},
	author = {Laki, László and Yang, Zijian Győző},
	pages = {357--372}
}

```