A Generalist Diffusion Model for Vision Perception
Music Generation - text to music, music continuation.
Generate anime character speech from text