--- tags: - espnet - audio - audio-to-audio - vocoder language: - en datasets: - vctk license: cc-by-4.0 inference: false --- ## Vocoder model - HifiGAN - English https://github.com/kan-bayashi/ParallelWaveGAN **No support given.** ### Details ``` batch_size: 16 discriminator_params: follow_official_norm: true period_discriminator_params: bias: true channels: 32 downsample_scales: - 3 - 3 - 3 - 3 - 1 in_channels: 1 kernel_sizes: - 5 - 3 max_downsample_channels: 1024 nonlinear_activation: LeakyReLU nonlinear_activation_params: negative_slope: 0.1 out_channels: 1 use_spectral_norm: false use_weight_norm: true periods: - 2 - 3 - 5 - 7 - 11 ```