esnya commited on
Commit
8b0c89d
1 Parent(s): 2ab9d6a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -2
README.md CHANGED
@@ -6,12 +6,18 @@ tags:
6
  - jvs
7
  - pyopenjtalk
8
  - speech-to-text
 
9
  ---
10
 
11
  # SpeechT5 (TTS task) for Japanese
12
- SpeechT5 model fine-tuned for speech synthesis (text-to-speech) on [JVS]("https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus").
 
 
 
 
13
  Trained from [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts).
14
- Modified tokenizer powered by [Open Jtalk](https://open-jtalk.sp.nitech.ac.jp/)
 
15
 
16
  # Model description
17
  See [original model card](https://huggingface.co/microsoft/speecht5_tts#model-description)
 
6
  - jvs
7
  - pyopenjtalk
8
  - speech-to-text
9
+ pipeline_tag: text-to-speech
10
  ---
11
 
12
  # SpeechT5 (TTS task) for Japanese
13
+ SpeechT5 model fine-tuned for Japanese speech synthesis (text-to-speech) on [JVS]("https://sites.google.com/site/shinnosuketakamichi/research-topics/jvs_corpus").
14
+ This model utilizes the JVS dataset which encompasses 100 speakers.
15
+ From this dataset, speaker embeddings were crafted, segregating them based on male and female voice types, and producing a unique speaker embedding vector.
16
+ This 16-dimensional speaker embedding vector is designed with an aim to provide a voice quality that is independent of any specific speaker.
17
+
18
  Trained from [microsoft/speecht5_tts](https://huggingface.co/microsoft/speecht5_tts).
19
+ Modified tokenizer powered by [Open Jtalk](https://open-jtalk.sp.nitech.ac.jp/).
20
+
21
 
22
  # Model description
23
  See [original model card](https://huggingface.co/microsoft/speecht5_tts#model-description)