Generate talking face animation from still images and audio
Generate realistic voices from text
Transform video frames using text instructions