Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.01912

Natural language guidance of high-fidelity text-to-speech with synthetic annotations

Paper • 2402.01912 • Published Feb 2, 2024 • 11

Parler-TTS: fully open-source high-quality TTS

If you want to find out more about how these models were trained and even fine-tune them yourself, check-out the Parler-TTS repository on GitHub.

Running on Zero

771

🥖

Parler-TTS

High-fidelity Text-To-Speech
parler-tts/parler-tts-mini-v1.1

Text-to-Speech • Updated Oct 30, 2024 • 2.77k • 13
parler-tts/parler-tts-large-v1

Text-to-Speech • Updated Nov 22, 2024 • 13.3k • 231
parler-tts/parler-tts-mini-v1

Text-to-Speech • Updated Nov 25, 2024 • 33.4k • 130

Open-source speech datasets annotated using Data-Speech

Open-source annotated speech datasets ranging from 1,000 hours to 45,000 hours.

parler-tts/mls_eng

Viewer • Updated Apr 9, 2024 • 10.8M • 25.3k • 14
parler-tts/libritts_r_filtered

Viewer • Updated Aug 6, 2024 • 359k • 1.69k • 12
parler-tts/mls-eng-speaker-descriptions

Viewer • Updated Aug 8, 2024 • 10.8M • 671 • 3
parler-tts/libritts-r-filtered-speaker-descriptions

Viewer • Updated Aug 8, 2024 • 359k • 447 • 3

Papers - Audio - TTS

Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

Paper • 1712.05884 • Published Dec 16, 2017 • 2
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

Paper • 2403.16973 • Published Mar 25, 2024 • 2
High Fidelity Neural Audio Compression

Paper • 2210.13438 • Published Oct 24, 2022 • 4
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

Paper • 2404.03204 • Published Apr 4, 2024 • 7

Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

Paper • 2402.07383 • Published Feb 12, 2024 • 13
Matcha-TTS: A fast TTS architecture with conditional flow matching

Paper • 2309.03199 • Published Sep 6, 2023 • 11
Natural language guidance of high-fidelity text-to-speech with synthetic annotations

Paper • 2402.01912 • Published Feb 2, 2024 • 11
Fast Timing-Conditioned Latent Audio Diffusion

Paper • 2402.04825 • Published Feb 7, 2024 • 7

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs