tts - a heiscold Collection

heiscold 's Collections

tts

music

diffusio_flow_matching

tts

updated about 3 hours ago

Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

Paper • 2402.07383 • Published Feb 12 • 13
Matcha-TTS: A fast TTS architecture with conditional flow matching

Paper • 2309.03199 • Published Sep 6, 2023 • 10
Natural language guidance of high-fidelity text-to-speech with synthetic annotations

Paper • 2402.01912 • Published Feb 2 • 7
Fast Timing-Conditioned Latent Audio Diffusion

Paper • 2402.04825 • Published Feb 7 • 7
FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23 • 29
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

Paper • 2406.02430 • Published Jun 4 • 27
LiveSpeech: Low-Latency Zero-shot Text-to-Speech via Autoregressive Modeling of Audio Discrete Codes

Paper • 2406.02897 • Published Jun 5 • 13
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers

Paper • 2406.05370 • Published Jun 8 • 12
E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS

Paper • 2406.18009 • Published 26 days ago • 18
Towards Robust Speech Representation Learning for Thousands of Languages

Paper • 2407.00837 • Published 21 days ago • 9
Autoregressive Speech Synthesis without Vector Quantization

Paper • 2407.08551 • Published 11 days ago • 12
Stable Audio Open

Paper • 2407.14358 • Published 3 days ago • 6
Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Paper • 2407.14329 • Published 3 days ago • 1