Generate clusters and visualizations from images
Transcribe audio or YouTube videos
Process video to analyze human visual motion