Joycent ParallelWaveGAN Vocoder
This repository stores the ParallelWaveGAN vocoder used by Joycent Mandarin accent text-to-speech inference.
The model generates 16 kHz audio from 80-bin mel spectrograms. Keep
checkpoint-50000steps.pkl and config.yml in the same directory when loading
the model with ParallelWaveGAN:
import yaml
from parallel_wavegan.utils import load_model
with open("config.yml", encoding="utf-8") as file:
config = yaml.load(file, Loader=yaml.Loader)
vocoder = load_model("checkpoint-50000steps.pkl", config)
vocoder.remove_weight_norm()
vocoder.eval()
The Joycent implementation and inference instructions are available in the Joycent repository.