Nix-TTS: Lightweight and End-to-End Text-to-Speech via Module-wise Distillation Paper • 2203.15643 • Published Mar 29, 2022 • 1
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale Paper • 2111.09296 • Published Nov 17, 2021 • 3
Audiobox: Unified Audio Generation with Natural Language Prompts Paper • 2312.15821 • Published Dec 25, 2023 • 16
Audiobox TTA-RAG: Improving Zero-Shot and Few-Shot Text-To-Audio with Retrieval-Augmented Generation Paper • 2411.05141 • Published Nov 7, 2024 • 1
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model Paper • 2309.13018 • Published Sep 22, 2023 • 9