WhisperSpeech

Runtime error

App Files Files Community

jonathansampson commited on Feb 25, 2024

Commit

2a59b71

verified ·

1 Parent(s): e315c05

addresses minor typos, and other grammar/formatting items

Browse files

Files changed (1) hide show

app.py +10 -11

app.py CHANGED Viewed

@@ -29,9 +29,9 @@ Huge thanks to [Tonic](https://huggingface.co/Tonic) who helped build this Space
 ### How to Use It
-Write you text in the box, you can use language tags (`<en>` or `<pl>`) to create multilingual speech.
-Optionally you can upload a speech sample or give it a file URL to clone an existing voice. Check out the
-examples at the bottom of the page for inspiration.
 """
 footer = """
@@ -52,15 +52,14 @@ pipe.generate_to_file("output.wav", "Hello from WhisperSpeech.")
 ```
 """
 text_examples = [
-    ["This is the first demo of Whisper Speech, a fully open source text-to-speech model trained by Collabora and Lion on the Juwels supercomputer.", None],
-    ["World War II or the Second World War was a global conflict that lasted from 1939 to 1945. The vast majority of the world's countries, including all the great powers, fought as part of two opposing military alliances: the Allies and the Axis.", "https://upload.wikimedia.org/wikipedia/commons/7/75/Winston_Churchill_-_Be_Ye_Men_of_Valour.ogg"],
     ["<pl>To jest pierwszy test wielojęzycznego <en>Whisper Speech <pl>, modelu zamieniającego tekst na mowę, który Collabora i Laion nauczyli na superkomputerze <en>Jewels.", None],
-    ["<en> WhisperSpeech is an Open Source library that helps you convert text to speech. <pl>Teraz także po Polsku! <en>I think I just tried saying \"now also in Polish\", don't judge me...", None],
-    # ["<de> WhisperSpeech is multi-lingual <es> y puede cambiar de idioma <hi> मध्य वाक्य में"],
     ["<pl>To jest pierwszy test naszego modelu. Pozdrawiamy serdecznie.", None],
-    # ["<en> The big difference between Europe <fr> et les Etats Unis <pl> jest to, że mamy tak wiele języków <uk> тут, в Європі"]
 ]
 def parse_multilingual_text(input_text):
@@ -109,14 +108,14 @@ with gr.Blocks() as demo:
         with gr.Column(scale=2):
             text_input = gr.Textbox(label="Enter multilingual text💬📝",
                                     value=text_examples[0][0],
-                                    info="You can use `<en>` for English and `<pl>` for Polish, see examples below.")
             cps = gr.Slider(value=14, minimum=10, maximum=15, step=.25,
                             label="Tempo (in characters per second)")
             with gr.Row(equal_height=True):
                 speaker_input = gr.Audio(label="Upload or Record Speaker Audio (optional)🌬️💬",
                                      sources=["upload", "microphone"],
                                      type='filepath')
-                url_input = gr.Textbox(label="alternatively, you can paste in an audio file URL:")
             gr.Markdown("  \n  ") # fixes the bottom overflow from Audio
             generate_button = gr.Button("Try Collabora's WhisperSpeech🌟")
         with gr.Column(scale=1):

 ### How to Use It
+Write your text in the box—you can use language tags (`<en>` or `<pl>`) to create multilingual speech.
+Optionally you can upload a speech sample, or give it a file URL to clone an existing voice. Check out the
+examples at the bottom of this page for inspiration.
 """
 footer = """
 ```
 """
 text_examples = [
+    ["This is the first demo of Whisper Speech, a fully open source text-to-speech model trained by Collabora and LAION on the Juwels supercomputer.", None],
+    ["World War II, or the Second World War, was a global conflict that lasted from 1939 to 1945. The vast majority of the world's countries, including all the great powers, fought as part of two opposing military alliances: the Allies and the Axis.", "https://upload.wikimedia.org/wikipedia/commons/7/75/Winston_Churchill_-_Be_Ye_Men_of_Valour.ogg"],
     ["<pl>To jest pierwszy test wielojęzycznego <en>Whisper Speech <pl>, modelu zamieniającego tekst na mowę, który Collabora i Laion nauczyli na superkomputerze <en>Jewels.", None],
+    ["<en>WhisperSpeech is an Open Source library that helps you convert text to speech. <pl>Teraz także po Polsku! <en>I think I just tried saying \"now also in Polish\", don't judge me…", None],
+    # ["<de>WhisperSpeech is multi-lingual <es> y puede cambiar de idioma <hi> मध्य वाक्य में"],
     ["<pl>To jest pierwszy test naszego modelu. Pozdrawiamy serdecznie.", None],
+    # ["<en>The big difference between Europe <fr> et les Etats Unis <pl> jest to, że mamy tak wiele języków <uk> тут, в Європі"]
 ]
 def parse_multilingual_text(input_text):
         with gr.Column(scale=2):
             text_input = gr.Textbox(label="Enter multilingual text💬📝",
                                     value=text_examples[0][0],
+                                    info="You can use `<en>` for English and `<pl>` for Polish. See examples below.")
             cps = gr.Slider(value=14, minimum=10, maximum=15, step=.25,
                             label="Tempo (in characters per second)")
             with gr.Row(equal_height=True):
                 speaker_input = gr.Audio(label="Upload or Record Speaker Audio (optional)🌬️💬",
                                      sources=["upload", "microphone"],
                                      type='filepath')
+                url_input = gr.Textbox(label="Alternatively, you can paste in an audio file URL:")
             gr.Markdown("  \n  ") # fixes the bottom overflow from Audio
             generate_button = gr.Button("Try Collabora's WhisperSpeech🌟")
         with gr.Column(scale=1):