Collections
Discover the best community collections!
Collections including paper arxiv:2402.01912
-
parler-tts/mls_eng
Viewer • Updated • 10.8M • 9.14k • 14 -
parler-tts/libritts_r_filtered
Viewer • Updated • 359k • 1.22k • 11 -
parler-tts/mls-eng-speaker-descriptions
Viewer • Updated • 10.8M • 270 • 2 -
parler-tts/libritts-r-filtered-speaker-descriptions
Viewer • Updated • 359k • 174 • 3
-
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions
Paper • 1712.05884 • Published • 2 -
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild
Paper • 2403.16973 • Published • 2 -
High Fidelity Neural Audio Compression
Paper • 2210.13438 • Published • 3 -
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis
Paper • 2404.03204 • Published • 7
-
Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like
Paper • 2402.07383 • Published • 13 -
Matcha-TTS: A fast TTS architecture with conditional flow matching
Paper • 2309.03199 • Published • 11 -
Natural language guidance of high-fidelity text-to-speech with synthetic annotations
Paper • 2402.01912 • Published • 11 -
Fast Timing-Conditioned Latent Audio Diffusion
Paper • 2402.04825 • Published • 7