File size: 1,016 Bytes
8aaaf4c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7b17341
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
---
language:
- 'en'  
- 'me' 
license: afl-3.0  
tags:
- 'translation'
datasets:
- 'Qilex/EN-ME' 
metrics:
- bleu  

model-index:
- name: 'en-me'
  results:
  - task:
      type: 'translation'          
      name: 'translation en-me'           
    dataset:
      type: 'translation'
      name: 'Qilex/EN-ME'      
    
    metrics:
      - type: 'bleu'         
        value: 17.2    
---

This is a BART-large model finetuned on roughly 58000 aligned sentence pairs in English and Middle English, collected from the works of Geoffrey Chaucer, John Wycliffe, and the Gawain Poet.
<br>
It includes special characters such as þ. 
<br>
This model reflects the spelling inconsistencies characteristic of Middle English.
<br>
Because the model is trained largely on poetry and some prose, it is best at translating those sorts of tasks.
<br>
Performance can be improved by sentence tokenizing input data and translating sentence-by-sentence.
<br>
Removing contractions (hadn't -> had not) also boosts performance.