Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Audio Conditioned LipSync with Latent Diffusion Models
Text to Audio (Sound SFX) Generator
Image Super-resolution via Diffusion Inversion
Memory-Guided Diffusion for Expressive Talking Video Gen
3D/4D Scenes from a Single Image w/ Controllable Video Diff