Generate audio from text using VITS
Generate audio from text using voice synthesis
Generate and convert speech using text and audio inputs