Generate text from images and videos
Segment objects in images and videos using text prompts
Create detailed images from sketches and other inputs
Calculate depth and normal maps from images
Generate depth map from image