Edit model card

Kotoba-Speech-v0.1

Kotoba-Speech v0.1 is a 1.2B Transformer-based speech generative model. It supports the following properties:

  1. Fluent text-to-speech generation in Japanese
  2. One-shot voice cloning through speech prompt

logo

Usage

Plesae check out our HF Spaces demo.

Model Details

  • Model type: Our model is end-to-end transformers.
  • Language(s): Japanese
  • Library: We'll releasde our training code soon. Inference and model code are largely adopted from metavoice.

Acknowledgements

  • We thank meta-voice for opensourcing their code.

License

Apache License Version 2.0, January 2004

Downloads last month
81

Space using kotoba-tech/kotoba-speech-v0.1 1

Collection including kotoba-tech/kotoba-speech-v0.1