Stable Video Diffusion 1.1 TensorRT

This repository hosts the TensorRT version of the Stable Video Diffusion (SVD) 1.1 Image-to-Video model.

Model Details

Please see Stable Video Diffusion (SVD) 1.1 Image-to-Video for the full model details.

This model is intended for research purposes only and should not be used in any way that violates Stability AI's Acceptable Use Policy.

Performance

SVD-XT 1.1 (25 frames, 25 steps)

	A100 80GB PCI	A100 80GB SXM	H100 80GB PCI
VAE Encoder	66.70 ms	65.68 ms	49.07 ms
CLIP	105.41 ms	53.20 ms	91.32 ms
UNet x 25	30,367.73 ms	27,489.88 ms	19,102.98 ms
VAE Decoder	4,663.63 ms	4,544.12 ms	3,382.62 ms
Total E2E	35,258.38 ms	32,166.41 ms	22,644.73 ms

Usage Example

Clone TensorRT and this repo then launch NGC container

git clone https://github.com/rajeevsrao/TensorRT.git
cd TensorRT
git checkout release/svd

git lfs install 
git clone https://huggingface.co/stabilityai/stable-video-diffusion-img2vid-xt-1-1-tensorrt

docker run --rm -it --gpus all -v $PWD:/workspace nvcr.io/nvidia/pytorch:23.12-py3 /bin/bash

Install libraries and requirements

cd demo/Diffusion
python3 -m pip install --upgrade pip
pip3 install -r requirements.txt
python3 -m pip install --pre --upgrade --extra-index-url https://pypi.nvidia.com tensorrt

Authenticate with huggingface

huggingface-cli login

Perform TensorRT optimized inference:

python3 demo_img2vid.py \
    --version svd-xt-1.1 \
    --onnx-dir /workspace/stable-video-diffusion-img2vid-xt-1-1-tensorrt \
    --engine-dir engine-svd-xt-1-1 \
    --build-static-batch \
    --use-cuda-graph \
    --input-image https://www.hdcarwallpapers.com/walls/2018_chevrolet_camaro_zl1_nascar_race_car_2-HD.jpg

stabilityai
/

stable-video-diffusion-img2vid-xt-1-1-tensorrt

Stable Video Diffusion 1.1 TensorRT

Model Details

Performance

SVD-XT 1.1 (25 frames, 25 steps)

Usage Example

Space using stabilityai/stable-video-diffusion-img2vid-xt-1-1-tensorrt 1