Spaces:
No application file
No application file
# Features | |
* [x] π Text-to-audio | |
* [x] π£ Text-to-speech | |
* [x] πΆ [Bark](https://github.com/suno-ai/bark) | |
* [x] π£ Speech generation | |
* [x] 𧬠Voice cloning | |
* [x] π Basic voice cloning | |
* [x] 𧬠[Accurate voice cloning](https://github.com/gitmylo/bark-voice-cloning-HuBERT-quantizer) | |
* [x] π€£ Disable stopping token option to let the AI decide how it wants to continue | |
* [x] π΅ [AudioLDM](https://github.com/haoheliu/AudioLDM) text-to-audio generation | |
* [x] π΅ [AudioCraft](https://github.com/facebookresearch/audiocraft) text-to-audio generation | |
* [x] π Audio-to-audio | |
* [x] πΆ Bark audio-to-audio using [a custom quantizer](https://github.com/gitmylo/bark-voice-cloning-HuBERT-quantizer) to deconstruct audio for bark input | |
* [x] π [RVC](https://github.com/RVC-Project/Retrieval-based-voice-conversion-webui) (retrieval based voice conversion) | |
* [x] 𧬠RVC training | |
* [x] πΈ [coqui-ai/TTS](https://github.com/coqui-ai/TTS) text-to-speech | |
* [x] π€ Automatic-speech-recognition | |
* [x] π€ [Whisper](https://github.com/openai/whisper) speech recognition | |
* [x] π [Extensions](extensions/index.md) | |
* [x] π Python | |
* [x] π Javascript | |
* [x] ποΈ Styling | |