PeriodGrad: Towards Pitch-Controllable Neural Vocoder Based on a Diffusion Probabilistic Model Paper • 2402.14692 • Published Feb 22, 2024
Release of Pre-Trained Models for the Japanese Language Paper • 2404.01657 • Published Apr 2, 2024 • 1
An Integration of Pre-Trained Speech and Language Models for End-to-End Speech Recognition Paper • 2312.03668 • Published Dec 6, 2023 • 1
Towards human-like spoken dialogue generation between AI agents from written dialogue Paper • 2310.01088 • Published Oct 2, 2023
Singing voice synthesis based on frame-level sequence-to-sequence models considering vocal timing deviation Paper • 2301.02262 • Published Jan 5, 2023