Jordan Myers
commited on
Commit
β’
340a52c
1
Parent(s):
66724ca
description update
Browse files
README.md
CHANGED
@@ -1,8 +1,8 @@
|
|
1 |
---
|
2 |
-
title: English
|
3 |
-
emoji:
|
4 |
colorFrom: indigo
|
5 |
-
colorTo:
|
6 |
sdk: gradio
|
7 |
sdk_version: 3.15.0
|
8 |
app_file: app.py
|
|
|
1 |
---
|
2 |
+
title: English / toki pona Translator
|
3 |
+
emoji: π¬
|
4 |
colorFrom: indigo
|
5 |
+
colorTo: purple
|
6 |
sdk: gradio
|
7 |
sdk_version: 3.15.0
|
8 |
app_file: app.py
|
app.py
CHANGED
@@ -54,10 +54,42 @@ app = gr.Interface(
|
|
54 |
["Its not always perfect, but its pretty okay!", "English", "toki pona", 3],
|
55 |
["ilo pi ante toki ni li pona a!", "toki pona", "English", 3],
|
56 |
["kijetesantakalu li pona", "toki pona", "English", 3],
|
|
|
57 |
],
|
58 |
cache_examples=False,
|
59 |
-
title="
|
60 |
-
description="
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
61 |
)
|
62 |
|
63 |
app.launch()
|
|
|
54 |
["Its not always perfect, but its pretty okay!", "English", "toki pona", 3],
|
55 |
["ilo pi ante toki ni li pona a!", "toki pona", "English", 3],
|
56 |
["kijetesantakalu li pona", "toki pona", "English", 3],
|
57 |
+
["mi li toki e toki pona", "toki pona", "toki pona", 3]
|
58 |
],
|
59 |
cache_examples=False,
|
60 |
+
title="English / toki pona Translation",
|
61 |
+
description="""
|
62 |
+
# A simple English / toki pona Neural Machine Translation App!
|
63 |
+
|
64 |
+
### toki a! π¬
|
65 |
+
|
66 |
+
This is a simple english to toki pona / toki pona to english neural machine translation app.
|
67 |
+
|
68 |
+
Input your text to translate, a source language and target language, and desired number of return sequences!
|
69 |
+
|
70 |
+
### Grammaticality / Regularization
|
71 |
+
English -> English and/or toki pona -> toki pona will result in some form of regularization.
|
72 |
+
|
73 |
+
This can approximate grammaticality, but it isn't always the best.
|
74 |
+
|
75 |
+
For example, "mi li toki e toki pona" [src: toki pona, tgt: toki pona] will result in ['mi toki e toki pona.', 'mi toki pona.', 'mi toki e toki pona']
|
76 |
+
(Thus, the ungrammatical "li" is dropped)
|
77 |
+
|
78 |
+
### Model and Data
|
79 |
+
This app utilizes a fine-tuned version of Facebook/Meta AI's M2M100 418M param model.
|
80 |
+
|
81 |
+
By leveraging the pretrained weights of the massively multilingual M2M100 model,
|
82 |
+
we can jumpstart our transfer learning to accomplish machine translation for toki pona!
|
83 |
+
|
84 |
+
The model was fine-tuned on the English/toki pona bitexts found at https://tatoeba.org/
|
85 |
+
|
86 |
+
### This app is a work in progress and obviously not all translations will be perfect.
|
87 |
+
In addition to parameter quantity and the hyper-parameters used while training,
|
88 |
+
the *quality of data* found on Tatoeba directly influences the perfomance of projects like this!
|
89 |
+
|
90 |
+
If you wish to contribute, please simply add high quality and diverse translations to Tatoeba!
|
91 |
+
|
92 |
+
"""
|
93 |
)
|
94 |
|
95 |
app.launch()
|