README / README.md
reach-vb's picture
reach-vb HF staff
Update README.md (#2)
43ff20e verified
metadata
title: README
emoji: 
colorFrom: green
colorTo: gray
sdk: static
pinned: false

Parler-TTS

Parler-TTS is a lightweight text-to-speech (TTS) model that can generate high-quality, natural sounding speech in the style of a given speaker (gender, pitch, speaking style, etc). It is a reproduction of work from the paper Natural language guidance of high-fidelity text-to-speech with synthetic annotations by Dan Lyth and Simon King, from Stability AI and Edinburgh University respectively.

Contrary to other TTS models, Parler-TTS is a fully open-source release. All of the datasets, pre-processing, training code, and weights are released publicly under a permissive license, enabling the community to build on our work and develop their own powerful TTS models. It consists in:

Base v0.1 model & demo out!

🚨 Expressive v0.1 model & demo out! Try it out here 🤗!