Joycent ParallelWaveGAN Vocoder

This repository stores the ParallelWaveGAN vocoder used by Joycent Mandarin accent text-to-speech inference.

The model generates 16 kHz audio from 80-bin mel spectrograms. Keep checkpoint-50000steps.pkl and config.yml in the same directory when loading the model with ParallelWaveGAN:

import yaml
from parallel_wavegan.utils import load_model

with open("config.yml", encoding="utf-8") as file:
    config = yaml.load(file, Loader=yaml.Loader)

vocoder = load_model("checkpoint-50000steps.pkl", config)
vocoder.remove_weight_norm()
vocoder.eval()

The Joycent implementation and inference instructions are available in the Joycent repository.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using walston/joycent-vocoder 1