Interact with conversation-driven apps
Generate realistic audio from text
Generate talking face animation from still images and audio