Spaces:

ReneeYe
/

ConST-speech2text-translator

Build error

ReneeYe commited on May 25, 2022

Commit

17b21b3

•

1 Parent(s): 715d968

change ui

Files changed (3) hide show

app.py CHANGED Viewed

@@ -42,8 +42,6 @@ os.system("git clone https://github.com/ReneeYe/ConST")
 os.system("mv ConST ConST_git")
 os.system('mv -n ConST_git/* ./')
 os.system("rm -rf ConST_git")
-# os.system("python3 setup.py install")
-# os.system("python3 setup.py build_ext --inplace")
 os.system("pip3 install --editable ./")
 os.system("mkdir -p data checkpoint")
@@ -144,12 +142,16 @@ iface = gr.Interface(
     fn=run,
     inputs=inputs,
     outputs=[gr.outputs.Textbox(label="The translation")],
-    examples=[['case1.wav', "German"],['case2.wav', "German"], ['case3.wav', "German"]],
     title="ConST: an end-to-end speech translator",
-    description="End-to-end Speech Translation Live Demo for English to eight European languages.",
-    article="ConST is an end-to-end speech translation model (see paper at https://arxiv.org/abs/2205.02444 ). "
-            "Its motivation is to use contrastive learning method to learn similar representations for semantically similar speech and text.",
-    theme="seafoam",
-    layout='vertical',
 )
 iface.launch()

 os.system("mv ConST ConST_git")
 os.system('mv -n ConST_git/* ./')
 os.system("rm -rf ConST_git")
 os.system("pip3 install --editable ./")
 os.system("mkdir -p data checkpoint")
     fn=run,
     inputs=inputs,
     outputs=[gr.outputs.Textbox(label="The translation")],
+    examples=[['short-case.wav', "German"], ['long-case.wav', "German"]],
     title="ConST: an end-to-end speech translator",
+    description='ConST is an end-to-end speech-to-text translation model, whose algorithm corresponds to the '
+                'NAACL 2022 paper *"Cross-modal Contrastive Learning for Speech Translation"* (see the paper at https://arxiv.org/abs/2205.02444 for more details).'
+                'This is a live demo for ConST, to translate English into eight European languages.',
+    article="- The motivation of the ConST model is to use the contrastive learning method to learn similar representations for semantically similar speech and text, " \
+            "thus leveraging MT to help improve ST performance. \n"
+            "- The models you are experiencing are trained based on the MuST-C dataset (https://ict.fbk.eu/must-c/), " \
+            "which only contains about 250k parallel data at each translation direction. \n"
+            "- If you want to know how to train the models, you may refer to https://github.com/ReneeYe/ConST.",
+    theme="peach",
 )
 iface.launch()

case3.wav → long-case.wav RENAMED Viewed

File without changes

case2.wav → short-case.wav RENAMED Viewed

File without changes