Parler-TTS

Parler-TTS is a lightweight text-to-speech (TTS) model that can generate high-quality, natural sounding speech in the style of a given speaker (gender, pitch, speaking style, etc). It is a reproduction of work from the paper Natural language guidance of high-fidelity text-to-speech with synthetic annotations by Dan Lyth and Simon King, from Stability AI and Edinburgh University respectively.

Contrary to other TTS models, Parler-TTS is a fully open-source release. All of the datasets, pre-processing, training code, and weights are released publicly under a permissive license, enabling the community to build on our work and develop their own powerful TTS models. It consists in:

The Parler-TTS library for using and training high-quality TTS models.
The Data-Speech repository, for annotating speech characteristics in a large-scale setting.
This organization, that contains the released datasets and weights.

🚨 v0.1 model & demo out! Try it out here 🤗!

Parler TTS

AI & ML interests

Parler-TTS

Collections 2

Parler-TTS Mini

parler-tts/parler_tts_mini_v0.1

parler-tts/dac_44khZ_8kbps

google/flan-t5-base

Parler-TTS Mini

blabble-io/libritts_r

parler-tts/libritts_r_tags_tagged_10k_generated

parler-tts/mls_eng_10k

spaces 1

Parler-TTS Mini

models 2

parler-tts/parler_tts_mini_v0.1

parler-tts/dac_44khZ_8kbps

datasets 5

parler-tts/images

parler-tts/mls-eng-10k-tags_tagged_10k_generated

parler-tts/libritts_r_tags_tagged_10k_generated

parler-tts/mls_eng_10k

parler-tts/mls_eng

AI & ML interests

Team members 3

Parler-TTS

Collections 2

Parler-TTS Mini

Parler-TTS Mini

spaces 1

Parler-TTS Mini

models 2 Sort: Recently updated

datasets 5 Sort: Recently updated

models 2

datasets 5