Generate and convert speech using text and audio inputs
Generate and convert audio using text or voice input
Analyze image to generate descriptive prompt