kotoba-tech
/

kotoba-speech-v0.1

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Kotoba-Speech-v0.1

Kotoba-Speech v0.1 is a 1.2B Transformer-based speech generative model. It supports the following properties:

Fluent text-to-speech generation in Japanese
One-shot voice cloning through speech prompt

Usage

Plesae check out our HF Spaces demo.

Model Details

Model type: Our model is end-to-end transformers.
Language(s): Japanese
Library: We'll releasde our training code soon. Inference and model code are largely adopted from metavoice.

Acknowledgements

We thank meta-voice for opensourcing their code.

License

Apache License Version 2.0, January 2004

Downloads last month: 81

Space using kotoba-tech/kotoba-speech-v0.1 1

Collection including kotoba-tech/kotoba-speech-v0.1

Kotoba-Speech

2 items • Updated Mar 22