CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought Paper • 2409.19510 • Published Sep 29, 2024
LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR Paper • 2406.06619 • Published Jun 7, 2024
FELLE: Autoregressive Speech Synthesis with Token-Wise Coarse-to-Fine Flow Matching Paper • 2502.11128 • Published 26 days ago