soujanyaporia commited on
Commit
3d974c0
1 Parent(s): a7ff543

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +8 -1
app.py CHANGED
@@ -90,7 +90,13 @@ the generations from TANGO for prompts Chopping tomatoes on a wooden table and C
90
  on a metal table are very similar. \
91
  <br/><br/>We are currently training another version of TANGO on larger datasets to enhance its generalization, \
92
  compositional and controllable generation ability.
93
- <br/><br/>We recommend using a guidance scale of 3. The default number of steps is set to 100. More steps generally lead to better quality of generated audios but will take a longer time.
 
 
 
 
 
 
94
  <p/>
95
  """
96
 
@@ -109,6 +115,7 @@ gr_interface = gr.Interface(
109
  description=description_text,
110
  allow_flagging=False,
111
  examples=[
 
112
  ["An audience cheering and clapping"],
113
  ["Rolling thunder with lightning strikes"],
114
  ["Gentle water stream, birds chirping and sudden gun shot"],
 
90
  on a metal table are very similar. \
91
  <br/><br/>We are currently training another version of TANGO on larger datasets to enhance its generalization, \
92
  compositional and controllable generation ability.
93
+ <br/><br/>We recommend using a guidance scale of 3. The default number of steps is set to 100. More steps generally lead to better quality of generated audios but will take longer.
94
+ <br/><br/>
95
+ <h1> ChatGPT-enhanced audio generation</h1>
96
+ <br/>
97
+ As TANGO consists of an instruction-tuned LLM, it is able to process complex sound descriptions allowing us to provide more detailed instructions to improve the generation quality.
98
+ For example, ``A boat is moving on the sea'' vs ``The sound of the water lapping against the hull of the boat or splashing as you move through the waves''. The latter is obtained by prompting ChatGPT to explain the sound generated when a boat moves on the sea.
99
+ Using this ChatGPT-generated description of the sound, TANGO provides superior results.
100
  <p/>
101
  """
102
 
 
115
  description=description_text,
116
  allow_flagging=False,
117
  examples=[
118
+ ["The sound of the water lapping against the hull of the boat or splashing as you move through the waves"],
119
  ["An audience cheering and clapping"],
120
  ["Rolling thunder with lightning strikes"],
121
  ["Gentle water stream, birds chirping and sudden gun shot"],