Chat with model about images
Transcribe audio from microphone, file, or YouTube link
Generate a cartoon video from two images