tenebrisu commited on
Commit
e1968fb
1 Parent(s): 454658c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -1
README.md CHANGED
@@ -3,12 +3,15 @@ datasets:
3
  - common_voice
4
  language:
5
  - uk
 
 
 
6
  ---
7
  This model is a fine-tuned version of SpeechT5 for the Ukrainian language, using the Common Voice dataset.
8
 
9
  ## Usage:
10
  ```python
11
- !pip install git+https://github.com/huggingface/transformers sentencepiece datasets
12
 
13
  from transformers import SpeechT5Processor, SpeechT5ForTextToSpeech, SpeechT5ForSpeechToSpeech, SpeechT5HifiGan
14
 
@@ -19,6 +22,8 @@ vocoder = SpeechT5HifiGan.from_pretrained("microsoft/speecht5_hifigan")
19
  speaker_embeddings = 2 * torch.rand((1, 512)) - 1
20
  text = """ pryvit yak spravy """
21
 
 
 
22
  speech = model.generate_speech(inputs["input_ids"], speaker_embeddings, output_cross_attentions=True)
23
  waveform = vocoder.forward(speech[0])
24
  ```
 
3
  - common_voice
4
  language:
5
  - uk
6
+ license: mit
7
+ base_mode: microsoft/speecht5_tts
8
+ pipeline_tag: text-to-speech
9
  ---
10
  This model is a fine-tuned version of SpeechT5 for the Ukrainian language, using the Common Voice dataset.
11
 
12
  ## Usage:
13
  ```python
14
+ !pip install git+https://github.com/huggingface/transformers
15
 
16
  from transformers import SpeechT5Processor, SpeechT5ForTextToSpeech, SpeechT5ForSpeechToSpeech, SpeechT5HifiGan
17
 
 
22
  speaker_embeddings = 2 * torch.rand((1, 512)) - 1
23
  text = """ pryvit yak spravy """
24
 
25
+ inputs = processor(text=text, return_tensors="pt")
26
+
27
  speech = model.generate_speech(inputs["input_ids"], speaker_embeddings, output_cross_attentions=True)
28
  waveform = vocoder.forward(speech[0])
29
  ```