Transcribe audio from microphone, file, or YouTube link
Create a 3D model from an image in 10 seconds!
Enable camera to start live vision
Generate text based on input prompts