Spaces:
Runtime error
Runtime error
jonathansampson
commited on
Commit
•
2a59b71
1
Parent(s):
e315c05
addresses minor typos, and other grammar/formatting items
Browse files
app.py
CHANGED
@@ -29,9 +29,9 @@ Huge thanks to [Tonic](https://huggingface.co/Tonic) who helped build this Space
|
|
29 |
|
30 |
### How to Use It
|
31 |
|
32 |
-
Write
|
33 |
-
Optionally you can upload a speech sample or give it a file URL to clone an existing voice. Check out the
|
34 |
-
examples at the bottom of
|
35 |
"""
|
36 |
|
37 |
footer = """
|
@@ -52,15 +52,14 @@ pipe.generate_to_file("output.wav", "Hello from WhisperSpeech.")
|
|
52 |
```
|
53 |
"""
|
54 |
|
55 |
-
|
56 |
text_examples = [
|
57 |
-
["This is the first demo of Whisper Speech, a fully open source text-to-speech model trained by Collabora and
|
58 |
-
["World War II or the Second World War was a global conflict that lasted from 1939 to 1945. The vast majority of the world's countries, including all the great powers, fought as part of two opposing military alliances: the Allies and the Axis.", "https://upload.wikimedia.org/wikipedia/commons/7/75/Winston_Churchill_-_Be_Ye_Men_of_Valour.ogg"],
|
59 |
["<pl>To jest pierwszy test wielojęzycznego <en>Whisper Speech <pl>, modelu zamieniającego tekst na mowę, który Collabora i Laion nauczyli na superkomputerze <en>Jewels.", None],
|
60 |
-
["<en>
|
61 |
-
# ["<de>
|
62 |
["<pl>To jest pierwszy test naszego modelu. Pozdrawiamy serdecznie.", None],
|
63 |
-
# ["<en>
|
64 |
]
|
65 |
|
66 |
def parse_multilingual_text(input_text):
|
@@ -109,14 +108,14 @@ with gr.Blocks() as demo:
|
|
109 |
with gr.Column(scale=2):
|
110 |
text_input = gr.Textbox(label="Enter multilingual text💬📝",
|
111 |
value=text_examples[0][0],
|
112 |
-
info="You can use `<en>` for English and `<pl>` for Polish
|
113 |
cps = gr.Slider(value=14, minimum=10, maximum=15, step=.25,
|
114 |
label="Tempo (in characters per second)")
|
115 |
with gr.Row(equal_height=True):
|
116 |
speaker_input = gr.Audio(label="Upload or Record Speaker Audio (optional)🌬️💬",
|
117 |
sources=["upload", "microphone"],
|
118 |
type='filepath')
|
119 |
-
url_input = gr.Textbox(label="
|
120 |
gr.Markdown(" \n ") # fixes the bottom overflow from Audio
|
121 |
generate_button = gr.Button("Try Collabora's WhisperSpeech🌟")
|
122 |
with gr.Column(scale=1):
|
|
|
29 |
|
30 |
### How to Use It
|
31 |
|
32 |
+
Write your text in the box—you can use language tags (`<en>` or `<pl>`) to create multilingual speech.
|
33 |
+
Optionally you can upload a speech sample, or give it a file URL to clone an existing voice. Check out the
|
34 |
+
examples at the bottom of this page for inspiration.
|
35 |
"""
|
36 |
|
37 |
footer = """
|
|
|
52 |
```
|
53 |
"""
|
54 |
|
|
|
55 |
text_examples = [
|
56 |
+
["This is the first demo of Whisper Speech, a fully open source text-to-speech model trained by Collabora and LAION on the Juwels supercomputer.", None],
|
57 |
+
["World War II, or the Second World War, was a global conflict that lasted from 1939 to 1945. The vast majority of the world's countries, including all the great powers, fought as part of two opposing military alliances: the Allies and the Axis.", "https://upload.wikimedia.org/wikipedia/commons/7/75/Winston_Churchill_-_Be_Ye_Men_of_Valour.ogg"],
|
58 |
["<pl>To jest pierwszy test wielojęzycznego <en>Whisper Speech <pl>, modelu zamieniającego tekst na mowę, który Collabora i Laion nauczyli na superkomputerze <en>Jewels.", None],
|
59 |
+
["<en>WhisperSpeech is an Open Source library that helps you convert text to speech. <pl>Teraz także po Polsku! <en>I think I just tried saying \"now also in Polish\", don't judge me…", None],
|
60 |
+
# ["<de>WhisperSpeech is multi-lingual <es> y puede cambiar de idioma <hi> मध्य वाक्य में"],
|
61 |
["<pl>To jest pierwszy test naszego modelu. Pozdrawiamy serdecznie.", None],
|
62 |
+
# ["<en>The big difference between Europe <fr> et les Etats Unis <pl> jest to, że mamy tak wiele języków <uk> тут, в Європі"]
|
63 |
]
|
64 |
|
65 |
def parse_multilingual_text(input_text):
|
|
|
108 |
with gr.Column(scale=2):
|
109 |
text_input = gr.Textbox(label="Enter multilingual text💬📝",
|
110 |
value=text_examples[0][0],
|
111 |
+
info="You can use `<en>` for English and `<pl>` for Polish. See examples below.")
|
112 |
cps = gr.Slider(value=14, minimum=10, maximum=15, step=.25,
|
113 |
label="Tempo (in characters per second)")
|
114 |
with gr.Row(equal_height=True):
|
115 |
speaker_input = gr.Audio(label="Upload or Record Speaker Audio (optional)🌬️💬",
|
116 |
sources=["upload", "microphone"],
|
117 |
type='filepath')
|
118 |
+
url_input = gr.Textbox(label="Alternatively, you can paste in an audio file URL:")
|
119 |
gr.Markdown(" \n ") # fixes the bottom overflow from Audio
|
120 |
generate_button = gr.Button("Try Collabora's WhisperSpeech🌟")
|
121 |
with gr.Column(scale=1):
|