Text-To-Speech-WaveGRU / wavegru.yaml
cowboyuniverse's picture
Duplicate from ntt123/WaveGRU-Text-To-Speech
dd50b29
raw
history blame
No virus
256 Bytes
## dsp
sample_rate : 24000
window_length: 50.0 # ms
hop_length: 12.5 # ms
mel_min: 1.0e-5 ## need .0 to make it a float
mel_dim: 80
n_fft: 2048
## wavegru
embed_dim: 32
rnn_dim: 1024
frames_per_sequence: 67
num_pad_frames: 62
upsample_factors: [5, 3, 20]