Create a 3D model from video or images
Generate descriptions from masked images
Generate audio from text and an optional audio prompt
ML-powered speech synthesis directly in your browser
Small reasoning model that runs locally in-browser
Self-Supervised Prompt Optimization
Decompose 3D shapes into parts with HoloPart
Embedding Leaderboard
Gemini native image for 3D co-drawing