Generate a 2-speaker podcast from text input or documents!
Detect objects in images or videos
Transcribe and translate audio into text