metadata

tags:
  - text-to-image
  - stable-diffusion
  - audio-to-video
license: apache-2.0
language:
  - en
library_name: diffusers

V-Express Model Card

Project Page | Paper | Code

Introduction

model_ckpts/sd-vae-ft-mse. VAE encoder. (original model card stabilityai/sd-vae-ft-mse)
model_ckpts/stable-diffusion-v1-5. Only the model configuration file for unet is needed here. (original model card runwayml/stable-diffusion-v1-5)
model_ckpts/v-express. The video generation model conditional on audio and V-kps we call V-Express.
You should download and put all .bin model to model_ckpts/v-express directory, which includes audio_projection.bin, denoising_unet.bin, motion_module.bin, reference_net.bin, and v_kps_guider.bin.