ylacombe commited on
Commit
2652175
β€’
1 Parent(s): b8391d0
Files changed (1) hide show
  1. app.py +2 -2
app.py CHANGED
@@ -13,7 +13,7 @@ device = "cuda:0" if torch.cuda.is_available() else "cpu"
13
 
14
 
15
  repo_id = "parler-tts/parler-tts-mini-v1"
16
- repo_id_large = "ylacombe/parler-large-v1-og"
17
 
18
  model = ParlerTTSForConditionalGeneration.from_pretrained(repo_id).to(device)
19
  model_large = ParlerTTSForConditionalGeneration.from_pretrained(repo_id_large).to(device)
@@ -154,7 +154,7 @@ with gr.Blocks(css=css) as block:
154
  are trained using 45k hours of narrated English audiobooks. It generates high-quality speech
155
  with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).</p>
156
 
157
- <p>By default, Parler-TTS generates 🎲 random voice. To ensure 🎯 <b> speaker consistency </b> across generations, these checkpoints were also trained on 34 speakers, characterized by name (e.g. Jon, Lea, Gary, Jenna, Mike, Laura).</p>
158
 
159
  <p>To take advantage of this, simply adapt your text description to specify which speaker to use: `Jon's voice is monotone...`</p>
160
  """
 
13
 
14
 
15
  repo_id = "parler-tts/parler-tts-mini-v1"
16
+ repo_id_large = "parler-tts/parler-tts-large-v1"
17
 
18
  model = ParlerTTSForConditionalGeneration.from_pretrained(repo_id).to(device)
19
  model_large = ParlerTTSForConditionalGeneration.from_pretrained(repo_id_large).to(device)
 
154
  are trained using 45k hours of narrated English audiobooks. It generates high-quality speech
155
  with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).</p>
156
 
157
+ <p>By default, Parler-TTS generates 🎲 random voice. To ensure 🎯 <b> speaker consistency </b> across generations, these checkpoints were also trained on 34 speakers, characterized by name (e.g. Jon, Lea, Gary, Jenna, Mike, Laura). Learn more about this <a href="https://github.com/huggingface/parler-tts/blob/main/INFERENCE.md#speaker-consistency"> here </a>.</p>
158
 
159
  <p>To take advantage of this, simply adapt your text description to specify which speaker to use: `Jon's voice is monotone...`</p>
160
  """